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A Corporate Dedication to 
Quality and Reliability 


National Semiconductor is an industry leader in the 
manufacture of high quality, high reliability integrated 
circuits. We have been the leading proponent of driv- 
ing down IC defects and extending product lifetimes. 
From raw material through product design, manufac- 
turing and shipping, our quality and reliability is second 
to none. : 

We are proud of our success . .. it sets a standard for 
others to achieve. Yet, our quest for perfection is on- 
going so that you, our customer, can continue to rely 
on National Semiconductor Corporation to produce 
high quality products for your design systems. 


A hie 


Charles E. Sporck 
President, Chief Executive Officer 
National Semiconductor Corporation 


Wir fihlen uns zu Qualitat und 
Zuverlassigkeit verpflichtet 


National Semiconductor Corporation ist fuhrend bei der Her- 
stellung von integrierten Schaltungen hoher Qualitat und 
hoher Zuverlassigkeit. National Semiconductor war schon 
immer Vorreiter, wenn es galt, die Zahl von IC Ausfallen zu 
verringern und die Lebensdauern von Produkten zu verbes- 
sern. Vom Rohmaterial Gber Entwurf und Herstellung bis zur 
Auslieferung, die Qualitat und die Zuverlassigkeit der Pro- 
dukte von National Semiconductor sind undbertroffen. 


' Wir sind stolz auf unseren Erfolg, der Standards setzt, die 
fiir andere erstrebenswert sind. Auch ihre Anspriche steig- 


enstandig. Sie als unser Kunde k6nnen sich auch weiterhin _ 


auf National Semiconductor verlassen. 


La Qualite et La Fiabilite: 


Une Vocation Commune Chez National 
Semiconductor Corporation 


National Semiconductor Corporation est un des leaders in- 
dustriels qui fabrique des circuits intégrés d’une trés grande 
qualité et d’une fiabilité exceptionelle. National a été le pre- 
mier a vouloir faire chuter le nombre de circuits intégrés 
défectueux et a augmenter la durée de vie des produits. 
Depuis les matiéres premiéres, en passant par la concep- 
tion du produit sa fabrication et son expédition, partout la 
qualité et la fiabilité chez National sont sans équivalents. 


Nous sommes fiers de notre succés et le standard ainsi . 


défini devrait devenir l’objectif a atteindre par les autres so- 
ciétés. Et nous continuons a vouloir faire progresser notre 
recherche de la perfection; il en résulte que vous, qui étes 
notre client, pouvez toujours faire confiance a National 
Semiconductor Corporation, en produisant des systemes 
d’une trés grande qualité standard. 


Un Impegno Societario di Qualita e 
Affidabilita 


National Semiconductor Corporation 6 un’industria al ver- 
tice nella costruzione di circuiti integrati di alta qualita ed 
affidabilita. National 6 stata il principale promotore per |’ab- 
battimento della difettosita dei circuiti integrati e per l’allun- 
gamento della vita dei prodotti. Dal materiale grezzo attra- 
verso tutte le fasi di progettazione, costruzione e spedi- 
zione, la qualita e affidabilita National non 6 seconda a nes- 
suno. 


Noi siamo orgogliosi del nostro successo che fissa per gli 
altri un traguardo da raggiungere. Il nostro desiderio di per- 
fezione é d’altra parte illimitato e pertanto tu, nostro cliente, 
puoi continuare ad affidarti a National Semiconductor Cor- 
poration per la produzione dei tuoi sistemi con elevati livelli 
di qualita. 


A Lind 


Charles E. Sporck 


President, Chief Executive Officer 
National Semiconductor Corporation 
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LIFE SUPPORT POLICY 


NATIONAL’S PRODUCTS ARE NOT AUTHORIZED FOR USE AS CRITICAL COMPONENTS IN LIFE SUPPORT DEVICES OR 
SYSTEMS WITHOUT THE EXPRESS WRITTEN APPROVAL OF THE PRESIDENT OF NATIONAL SEMICONDUCTOR COR- 


PORATION. As used herein: 


—_s 


. Life support devices or systems are devices or systems 
which, (a) are intended for surgical implant into the body, 
or (b) support or sustain life, and whose failure to per- 
form, when properly used in accordance with instructions 
for use provided in the labeling, can be reasonably ex- 
pected to result in a significant injury to the user. 


2. A critical component is any component of a life support 
device or system whose failure to perform can be reason- 
ably expected to cause the failure of the life support de- 
vice or system, or to affect its safety or effectiveness. 


National Semiconductor Corporation 2900 Semiconductor Drive, P.O. Box 58090, Santa Clara, California 95052-8090 (408) 721-5000 


TWX (910) 339-9240 


National does not assume any responsibility for use of any circuitry described, no circuit patent licenses are implied, and National reserves the right, at any time 


without notice, to change said circuitry or specifications. 
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Introduction 


National Semiconductor’s graphics and imaging products in- 
clude complex VLSI peripheral circuits designed to serve a 
variety of applications. These products are especially well 
suited for microcomputer and microprocessor systems such 
as workstations, terminals, personal computers, and many 
others. 


GRAPHICS 


Sophisticated human interface is the mark of the newest 
computer systems. Today’s personal computer may have 
better graphics display capability than engineering worksta- 
tions of a few years ago. National Semiconductor has devel- 
oped a new family of Advanced Graphics products to pro- 
vide extremely high performance, high resolution mono- 
chrome or color graphics displays. The graphics chip set is 
designed to provide the highest level of performance with- 
out placing constraints on the overall system design or per- 
formance. Flexibility is as important as is overall perform- 

‘ance. That flexibility is provided by the partitioned function- 
ality, modular building block approach, open architecture, 
programmability of all components and the ability to address 
the frame buffer in a planar (parallel) mode or in a pixel-wise 
mode. The graphics system may be expanded to any num- 
ber of color planes with virtually unlimited resolution. The 
Graphics Databook provides all of the details to make dis- 
play system design easy. 
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Advanced Peripherals 
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ne Advanced Peripherals 


Ethernet 


DRAM 
Management 


‘National Semiconductor Advanced Peripherals products in- 


clude complex VLSI peripheral circuits designed to serve a 
variety of applications. The Advanced Peripherals products 
are especially well suited for microcomputer and microproc- 
essor systems such as graphics workstations, personal 
computers, and many others. National Semiconductor Ad- 
vanced Peripherals devices are fully described in a series of 
databooks and handbooks. 


Among the Advanced Peripherals books are the following 
titles: 


MASS STORAGE 


The National Semiconductor family of mass storage inter- 
face products offers the industry’s highest performance and 
broadest range of products for Winchester hard disks and 
floppy disks. The Mass Storage Handbook includes com- 
plete product information and datasheets as well as a com- 
prehensive design guide for disk controller systems. 


DRAM MANAGEMENT 


Today’s large Dynamic Random Access Memory (DRAM) 
arrays require sophisticated high performance devices to 
provide timing access arbitration on board drive and control. 
National Semiconductor offers the broadest range of DRAM 
controllers with the highest “No-waitstate” performance 
available on the market. Controllers are available in Junc- 
tion Isolated LS, Oxide Isolated ALS, and double metal 
CMOS for DRAMs from 64k bit through 4M bit devices, sup- 
porting memory arrays up to 64 Mbyte in size with only one 
LSI/VLSI device. For critical applications, National Semi- 
conductor has developed several 16- and 32-bit Error 
Checking and Correction (ECC) devices to provide maxi- 
mum data integrity. The Memory Support Handbook con- 
tains complete product information and several application 


_hotes detailing complete memory system design. 


LOCAL AREA NETWORKS AND DATA 
COMMUNICATIONS 

Today’s computer systems have created a huge demand for 
data communications and Local Area Networks (LANs). 


Real Time Clocks 


-——F 3270 Protocol 


vi 


TL/XX/0058-1 


National Semiconductor provides a complete three-chip so- 
lution for an entire IEEE 802.3 standard for Ethernet/ 
Cheapernet LANs. National Semiconductor offers a com- 
pletely integrated solution for the IBM 370 Class main- 
frames, System 3X and AS/400 systems for physical layer 
front end and processing of the IBM 3270/3299 “‘coaxial’’ 
and 5250 “‘twinaxial” protocols. To drive the communica- 
tions lines, National Semiconductor has drivers and receiv- 
ers designed to meet all the major standards such as RS- 
232, RS-422, and RS-485. Datasheets and applications in- 
formation for all these products are in the LAN/DATA 
COMM Handbook. 


GRAPHICS 


Sophisticated human interface is a mark of the newest com- 
puter systems designs. Today’s personal computer may 
have better graphics display capability than engineering 
workstations of a few years ago. National Semiconductor 
has developed a new family of Advanced Graphics products 
to provide extremely high performance, high resolution color 
graphics displays. The graphics chip set is designed to pro- 
vide the highest level of performance with minimum de- 
mands and loading on the system CPU. The graphics sys- 
tem may be expanded to any number of color planes with 
virtually unlimited resolution. The Graphics Databook lays it 
all out and makes the display system design easy. 


REAL TIME CLOCKS 


National offers a family of Real Time Clocks (RTCs) and 
advanced Timer Clock Peripherals (TCPs). The RTC family 
provides a simple «P bus compatible interface to any sys- 
tem requiring accurate, reliable, on-going real time and cal- 
ender functions. The TCP family offers the RTC, RAM and 
two 16-bit programmable timers with fast ~P bus handshake 
controls for chip select, read and write. The Real Time 
Clock handbook includes complete product information and 
datasheets as well as applications information. 
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Product Status Definitions 
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Definition of Terms 


Data Sheet Identification Product Status Definition 


Formative or _ This data sheet contains the design specifications for product 
In Design development. Specifications may change in any manner without notice. 


First This data sheet contains preliminary data, and supplementary data will 
Production be published at a later date. National Semiconductor Corporation 
reserves the right to make changes at any time without notice in order 
to improve design and supply the best possible product. 


Full This data sheet contains final specifications. National Semiconductor 
Production Corporation reserves the right to make changes at any time without 
notice in order to improve design and supply the best possible product. 


National Semiconductor Corporation reserves the right to make changes without further notice to any products herein to 
improve reliability, function or design. National does not assume any liability arising out of the application or use of any product 
or circuit described herein; neither does it convey any license under its patent rights, nor the rights of others. 
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DP8500 Raster Graphics Processor 


General Description 


National’s DP8500 Raster Graphics Processor (RGP) is a 
microprocessor specifically tuned for graphics applications. 
A member of the Advanced Graphics Chip Set, it provides 
the set of functions required for display buffer update and 
video refresh in mid-to-high-performance color or mono- 
chrome raster graphics systems employing both graphics 
primitives and text. The RGP combines the following ele- 
ments: a general-purpose microcoded microprocessor, a 
programmable video refresh generator, a vector generator, 
a BITBLT controller and a rectangular clipper. As such, it 
may be used in standalone applications or as a dedicated 
graphics engine in conjunction with any general-purpose mi- 
croprocessor. 


The RGP supports a system architecture that features con- 
stant drawing speed, measured in pixels/second, indepen- 
dent of the depth (number of bits) of the pixel. This key 
feature arises from the RGP’s use of an external data path 
device, the BITBLT Processing Unit (BPU), for all drawing 
functions. By employing a BPU on each plane of memory, 
the traditional “bottleneck”’ is removed from the data manip- 
ulation path. In effect, the data bus width (for drawing pur- 
poses) is made proportional to the pixel depth, thus preserv- 
ing the drawing speed as pixel depth is increased from one 
bit to any number of bits. 


During video refresh, the RGP produces all synchronization 
and blanking signals for CRT displays and generates memo- 
ry cycles, appropriate for the type of memory used, on be- 
half of the video shift registers. Any type of memory may be 
used: SRAM, DRAM or video RAM. In addition, the RGP 
supports the use of fast access modes in dynamic RAMS, 
such as page mode or static column mode. 


Block Diagram 


Features 
m@ 20 MHz operation 
m@ Fully programmable | 
m Large, uniform address space 
— 28-bit bit (pixel) address 
— 24-bit word address 
— 16-bit data bus 
— Program, data, and display memories can reside 
anywhere . 
m@ Flexible bus interface | 
— Processor independent 
— Conventional HOLD/HLDA mechanism 
m Large drawing space 
—— Up to 16384 by 16384 pixels per bitmap 
— Pixels of any depth 
m Dedicated graphics hardware 
— Vector generator 
— Line pattern generator 
— BITBLT controller 
— Rectangular clipper 
@ Efficient text support 
— Multiple fonts/sizes 
— Proportional spacing 
@ Programmable video refresh 
— Can be disabled for laser printer applications 
— Pixel rates to 250 MHz and beyond 
— Display formats to 65536 pixels by 4096 scan lines 
—- Interlaced or non-interlaced 
— Genlock support 
@ microCMOS technology 
m 68-lead PLCC package 


Address/Data Bus Control Status 


instruction 
Decode 
and Control 


Interrupts 
Reset 


_ Line, BLT 
4 Decode and 
Control 


BPU Control 


Pixel 
Clipper 


Refresh and 
Monitor Control 


TL/F/9427-1 


DP8&500 


Connection Diagram 


Piastic Chip Carrier | 
moat ~m no 3 Yo 8 
eS58552522585822258 


AI6 > ADS, 

A17 = ADA 
GNDB2 “= LCLK 

A18 » AD3 

A19  AD2 

azo vecs4 

RST ADI 

A24 | GNDB6 

int ~. DP8500 RGP. Reserved 
GNDB3 ADO 

A22 Reserved 

NMI BLANK 

A23 HSYNC 

PH2 VSYNC 
vCCB2 GNDBS. 

HALT BSE 
BRREO —/6 

29 30 31 : 
BAR®E SB aRga gee Ra 
«lr Zz = 6 °$ 9 | 
. : TL/F/9427-2 
| Order Number DP8500V . 


See NS Package Number V68A 


Pin Descriptions 


| Pin Description a 


SUPPLIES 


GNDL1- 


GNDL2 
Positive supply for on-chip buffers: 


VCCB1- 

VCCB4 | 5Vdc + 10%. | | 
GNDB1- | Ground for on-chip buffers. | | 
GNDB7 | 


INPUTS 
PH1 


Phase 1. PH1 is an MOS-level clock normally © 
provided by the VCG. It must have no overlap with 
the PH2 clock. . 


Phase 2. PH2 is an MOS-level clock normally — 
provided by the VCG. It must have no overlap with 
the PH1 clock. eas 


- ‘Description 


INPUTS (Continued) 


Load Clock. LCLK is a TTL-compatible clock 
normally supplied by the VCG’s LCLKO. It provides 
the basic time unit used in the registers that 
specify digitally the video refresh functions: sync, 
blanking and display-refresh bus cycles. 


Reset In (active low) RSTI is used to place the 
RGP into the reset state; this is typically done at 
| power-up. 


Wait (active low). WAIT is used to cause the RGP 
to insert one or more wait states into the current 
bus cycle. This mechanism can be used to 
accommodate the RGP to relatively slow memory 
devices; it can also serve as a bus-not-ready 

_ indication from a bus arbiter. 


Pin Descriptions (Continued) 


| Pin | Description | 
INPUTS (Continued) 
HOLD 


0 Hold Request (active low). HOLD serves as a 
request from another master for the RGP’s 
bus. In response, the RGP will complete the 
current bus cycle (if any), TRI-STATE® only 
the address and data buses and assert Hold 
Acknowledge. 
enabled upon the first store operation to the 
PSR register. 
RESERVED | Reserved for NSC testing. Must be tied to 
ground. 
OUTPUTS 
A16-A23 Address Lines. A16-—A23 provide the most 
significant eight bits of memory addresses 
during bus cycles. A23 is the most significant. 
BSO-BS1 


Interrupt Request (active low). INT causes the 
RGP to suspend normal processing after 
completion of the current instruction (if any), to 
save the Program Counter and Processor 
Status Register on the stack and to enter the 
user's interrupt service routine. This function 
can be disabled via the Processor Status 
Register. 


Non-Maskable Interrupt Request (negative 
edge triggered). NMI causes the RGP to 
suspend normal processing after the 
completion of the current instruction (if any), to 
save the Program Counter and Processor 
Status Register on the stack and to enter the 
user’s non-maskable interrupt service routine. 
This function is disabled upon Reset, and is 


A16-A23 are at TRI-STATE whenever HLDA 
is asserted by the RGP. Addresses are 
guaranteed to be valid at the falling edge of 
ALE. 


Bus Status Lines. BSO-BS1 are used to 
indicate the type of bus cycle to be performed 
by the RGP. (See Table |.) BSO—-BS1 become 
valid during the first T-STATE of a bus cycle 
(before the rising edge of ALE) and remain 
valid through the end of the last T-state of that 
cycle. 


Read (active low status). RD indicates that 
the current cycle will cause memory to be 
read. RD becomes valid during the first T- 
state of a bus cycle (before the rising edge 
of ALE) and remains valid through the end of 
the last T-state of that cycle. Both RD and 
WR may be asserted in the same cycle. This 
indicates that the RGP is requesting a read- 
modify-write operation in the current cycle. 


| Pin | Description 
OUTPUTS (Continued) 


Write (active low status). WR indicates that the 
current cycle will cause memory to be written. 
WR becomes valid during the first T-state of a 
bus cycle (before the rising edge of ALE) and 
remains valid through the end of the last T-state 


of that cycle. Both WR and RD may be asserted 
in the same cycle. This indicates that the RGP is 
requesting a read-modify-write operation in the 

current cycle. 


Address Latch Enable. ALE indicates the 
beginning of a bus cycle. Its rising edge 
indicates that bus status, consisting of BS1-0, 
RD and WR are valid. Its falling edge indicates 


that the PB and the address, consisting of ADO- 
15, A16—23 and (if used in this cycle) BO-3, are 
valid. 


Page Break (active low status). PB indicates that 


W 

A 
the 16 most significant address bits (AD8-15 
and A16—23) of the RGP’s current bus cycle are 
not equal to the 16 most significant address bits 
generated by the RGP in the previous bus cycle. 
PB can be used by the system’s memory 
controller to allow it to generate page-mode 
accesses to memory, resulting in shorter access 
times. Since the eight least significant bits of 
addresses are ignored in the above comparison, 


the page size is always 256 words. 


R 
LE 
HLDA | Hold Acknowledge (active low). HLDA indicates 
that the RGP has put the address and data 
buses at TRI-STATE and has entered an 
internal hold condition. The RGP will not exit the 
hold condition until the HOLD input has been 
ANK 


removed. 


g 
-< 
Zz 


Horizontal Synchronization (active low). HSYNC 


indicates to the CRT monitor that the horizontal 
sweep should begin its retrace. The RGP can be 
programmed to use this output as composite 
syne, that is, the exclusive-NOR of HSYNC and 


VSYNC. This option is selected via the RGP’s 
Video Control Register. 


Composite Blanking (active low). BLANK is the 
result of ORing the internal vertical and 
horizontal blanking signals. BLANK indicates to 
the CRT monitor that the screen should be 
blanked. 


Display Refresh Request (active low). DRREQ 
starts generation of a Video Refresh bus cycle. 
DRREQ can be used by an external bus arbiter 
as a high-priority bus request. It can also be 
used for precise control of video DRAM transfer 
cycles, e.g., when performing mid-scanline 
transfers. 


| 


DRREQ 
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Pin Descriptions (Continued) 


| Pin | Description 


| OUTPUTS (Continued) 


Linedraw/BITBLT mode. L/B sets the operating 
- | mode of the BPU during drawing operations. 
L/B is high to place the BPU into line-drawing 
| mode, low to place the BPU into BITBLT mode. 
L/B is asserted throughout execution of any 
_ | drawing instruction that uses the BPU, starting 

| prior to the generation of any bus cycles within 


that instruction. 
Z BITBLT Source Enable. BSE directs the flow of 
BO/LME | 


data within the BPU. When high, BSE indicates 
to the BPU that the current bus cycle is 
associated with BITBLT data in the source 
rectangle. BSE is low during BITBLT destination 
data bus cycles and during non-BITBLT cycles. 
It is valid on the rising edge of ALE and remains 
valid throughout the bus cycle. 


Bit Select 0 or Left Mask Enable. This output to 
the BPU serves two purposes. When L/B is high 
it functions as BO (see below). When L/B is low 
it functions as LME, which serves to enable the 
left mask. LME is valid at the falling edge of ALE 
and remains valid throughout the bus cycle. 


Bit Select 1 or Right Mask Enable. This output to 
the BPU serves two purposes. When L/B is high 
it functions as B1 (see below). When L/B is low 
it functions as RME, which serves to enable the 
right mask. RME is valid at the falling edge of 
ALE and remains valid throughout the bus cycle. 


Bit Select 2 or FIFO Write. This output to the 
BPU serves two purposes. When L/B is high it 
functions as B2 (see below). When L/B is low it 
functions as FWR, which causes the BPU’s 
barrel-shifter output to be written to the BPU’s 
FIFO. FWR is valid on the rising edge of PH1, 
two clock periods after WAIT is sampled high 

| during the next T2-state bus cycle. 


Bit Select 3 or FIFO Read. This output to the 
BPU serves two purposes. When L/B is high it 
functions as B3 (see below). When L/B is low it 
functions as FRD, which causes the BPU’s FIFO 
output to be read into the BPU’s logic unit. FRD 
is valid at the rising edge of a PH1 during the 
fetch of the corresponding destination data 
word. 


Bit Select. When L/B is low, these four outputs 
have other functions (see above). When L/B is 
high, BO-3 select a specific bit within the word 
addressed by ADO-15 and A16-23. BO-3 
become valid prior to the falling edge of ALE 
and remain valid throughout the bus cycle. 


| Pin | Description 
QUTPUTS (Continued) - a a | 


HALT | Halt (active low). HALT indicates that the RGP - 
has executed a HALT instruction and entered 

| the halt state, a state in which no instructions 

are processed, but video refresh functions 

continue. The halt state can be exited via an 

interrupt or reset. 


Reset Out (active low). RSTO is driven low 
whenever RST1 is driven low. It is also driven low 
| (for two clock cycles) at the end of execution of 
FILLA or FILLT instructions. The INITB 
instruction drives RSTO low for two clocks. 
RSTO is. normally connected to the BPU’s 
RESET input. aaa 


INPUT/OUTPUTS 


Multiplexed Address and Data lines. ADO-—15 
serve as outputs early in a bus cycle, providing 
the 16 least significant memory address bits. 
Later in the bus cycle they serve as the data 
bus. ADO is the least significant bit of data or 
address. As a data bus, these lines can be 
inputs (during reads), outputs (during writes) or 
can be ignored by the RGP (e.g., during 
BITBLTs). These lines are at TRI-STATE 
whenever Hold Acknowledge is asserted by the 
RGP. Addresses are guaranteed to be valid at | 
the falling edge of ALE. 


Vertical Synchronization (active low). VSYNC 
can serve as an input or an output. As an output, 
it indicates to the CRT monitor that the vertical 
sweep should begin its retrace. As an input, it 
clears the internal counters associated with 
vertical sync generation within the RGP. As 
such, it allows the RGP to synchronize itself with 
an external video source (this assumes the use 
of horizontal synchronization features of the 
VCG). This option is selected via the RGP’s 
Video Control Register. 


Tabie | 


(BS1, BSO) (Function) 
| 0 | __OperandReador Write 


Architectural Description 


AGCS OVERVIEW 


The RGP, serving as the core of a bitmapped graphics sys- 
tem, is designed to work in concert with the other members 
of the Advanced Graphics Chip Set (AGCS). Other compo- 
nents of the chip set include the DP8512 Video Clock Gen- 
erator (VCG), the DP8515/16 Video Shift Register (VSR) 
and the DP8511 BITBLT Processing Unit (BPU). Additional 
functions required for system implementation are provided 
by National through such components as the DP8520/22 
Video DRAM Controller and the Bus State Machine (BSM). 


The components that comprise the Advanced Graphics 
Chip Set are fabricated in a variety of technologies, each 
appropriate to the function performed by that component. 
Both CMOS and bipolar technologies are used in the family. 
As a result, the family exhibits both VLSI functionality and 
250 MHz operation. 


The Advanced Graphics Chip Set supports a high-perform- 
ance architecture without imposing a particular bus proto- 
col, timing or memory type upon the system designer. As a 
result, AGCS-based systems can be realized with a range of 
solutions to the cost/performance tradeoff. Advances in 
memory technology can be capitalized upon in future sys- 
tems while retaining software compatibility. 
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Drawing | 
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A major feature of the system architecture is the support of 
simultaneous data manipulation (during drawing) at each of 
the bitplanes. This allows the system to retain its drawing 
speed (in pixels per second) as pixel depth increases from 
one to any number of bits. 


The functional interconnection of these components is illus- 
trated in Figure 71, which represents a minimal bitmapped 
graphics system. In this system, the intensity of each pixel is 
described by a single bit, i.e., each pixel is either on or off. 
The roles of the various components are described below, 
as are design considerations for color systems. 


DP8500 Raster Graphics Processor (RGP) 


The RGP is designed to be the overall control mechanism in 
graphics systems. It draws graphics objects in the display 
buffer, refreshes the video display, and performs general 
purpose computing tasks. 

The implementation of the RGP reflects these functions 
(see Figure 2). A general purpose microcoded microproces- 
sor core is augmented by dedicated hardware for the setup 
and execution of graphics primitives. In addition, a program- 
mable state machine handles all video synchronization 
functions and produces addresses for video refresh. Inter- 
nal bus arbitration logic controls access to the external bus. 
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FIGURE 2. RGP Organization Diagram 
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Architectural Description (continued) 


DP8511 BITBLT Processing Unit (BPU) 


The DP8511 BPU provides a data path for changing the 
contents of the frame buffer. The drawing functions of the 
RGP, e.g., line drawing and polygon filling, rely on the BPU 
to accept data from memory, modify it, and return it to mem- 
ory. The general scheme underlying these operations is 
BITBLT (BIT boundary BLock Transfer). During BITBLT, the 
RGP serves as an address generator to initiate the appropri- 
ate memory cycles and as a strobe generator to control the 
BPU’s operation. 


The BPU is a microCMOS device intended to provide hard- 
ware support for BITBLT. It includes a barrel shifter, used to 
shift source data into bit alignment with the destination and 
a FIFO to contain a series of shifted source words. A 16- 
function logic unit allows all possible bitwise combinations 
between the source and destination data. The BPU has 
hardware to support pixel operations and line drawing. In 
particular, it can address a single bit within the current word 
and read and write the selected bit. Details of BPU opera- 
tion can be obtained from the BPU data sheet. 


The use of BITBLT as the underlying mechanism for all 
drawing operations promotes a common method for han- 
dling both simple systems, as shown in Figure 7, and more 
complex, high performance systems, as shown in Figure 3. 
This latter figure illustrates a multi-plane system, i.e., one in 
which several bits (one from each plane) describe the value 
of a pixel. This might be used to describe the intensity (in 
monochrome) or the hue (in color) of the pixel. The use of a 
BPU per plane, permitting simultaneous update of all 
planes, leads to the highest performance. Alternatively, a 
cost-performance trade-off might be made by sharing one 


or more BPUs across two or more planes. At one extreme 


(single BPU) updates are done serially to each plane. At the 
other extreme (a BPU per plane) updates are performed ina 
purely parallel manner. A middie ground could be imple- 
mented, updating the planes in a serial-parallel manner. The 
mechanism of implementing the data path function in the 
BPU, rather than the RGP, produces this design flexibility. 


DP8515/16 Video Shift Register (VSR) 
The DP8515/16 VSR shifts pixel information at the data 


rate required by the CRT. It is implemented in National’s | 


Bipolar-CMOS process. It combines CMOS control logic 
and a CMOS input FIFO with an ECL shifter. As a result, it 
provides the system designer additional timing flexibility in 
the load path coupled with up to 350 MHz shift capability 
without excessive current consumption. The DP8515 pro- 
vides ECL 10k compatible outputs, while the DP8516 is ECL 
100k compatible. Parallel loading of the VSR is initiated by 
the RGP and assisted by the VCG. | 


DP8512 Video Clock Generator (VCG) 


The DP8512 VCG, implemented in National’s oxide-isolated 
bipolar process, provides all clocks in the system. It gener- 
ates all clocks from a relatively low frequency (less than 
20 MHz) crystal or external clock, simplifying system design 
and reducing system cost. 


A two-phase MOS processor clock is supplied to the RGP 
and BPU; TTL clocks are generated for the RGP video re- 
fresh logic and for the VSR’s FIFO control functions. The 
VSR’s load and shift functions are controlled by ECL clocks 
generated by the VCG. An on-chip phase-locked loop (PLL) 
multiplies the reference crystal/clock in order to generate a 
pixel clock as high as 225 MHz. The VCG also includes 
another PLL for synchronizing the horizontal Byne (generat- 
ed by the RGP) to an external source. 


PROGRAMMING MODEL 


The RGP is a microprocessor combined with a concurrent 
video-refresh machine. This section will discuss the pro- 
grammer-visible aspects of both the microprocessor and 


‘the video-refresh machine. For additional information, refer 


to the DP8500 RGP Programmer’s Reference Manual 


~ (PRM). 


The processor section of the RGP is a general purpose mi- 
croprocessor with an instruction set expanded to include 


graphics operations. While the processor is microcode driv- 


en, certain graphics operations, notably BITBLT, line draw- 
ing and clipping, are implemented via dedicated hardware 
for increased throughput. 


The organization of the processor, shown in Figure 4, re- 
flects the duality, present in raster-graphics systems, of 
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FIGURE 3. High-End AGCS-Based Color Bit-Mapped Graphics System 
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Architectural Description (continued) 


Data 
Processor 


a . TL/F/9427-6 
FIGURE 4. RGP Dual-Processor Configuration 


a virtual (Cartesian, x-y) drawing space and a physical 


(memory address) space. That is, the RGP’s processor sec- — 
tion actually consists of two processors, the Address Proc- — 


essor (AP) and the Data Processor (DP), operating concur- 
rently, driven by common microcode. The RGP’s register 
complement, described below, is shown in Figure 5. 


The AP consists of a 28-bit Arithmetic and Logic Unit (ALU), .- 


having a relatively simple instruction set, and a private bank 
of sixteen 28-bit registers. The DP is composed of a 16-bit 
ALU, having a relatively rich instruction set, and a bank of 
sixteen 16-bit registers. These instruction sets operate reg- 
ister-to-register only and then only within their respective 
register banks. _ 


The remaining registers of the RGP have dedicated func- 
tions in support of the graphics environment or the video 
refresh mechanism. Examples include the registers of the 
Clipper and the Display Buffer Base Address register (DBB). 


A single stream of instructions, fetched from external mem- 
ory, serves both processors via microcode control. This in- 
cludes the register-to-register instructions of both the AP 
and DP and the instructions for data transfers between RGP 
registers and memory. Additional RGP instructions, not be- 
longing exclusively to the AP or DP, use both processors 
and, often, additional on-chip resources; this is typical of all 
of the drawing instructions. An example of this is the DRLN 
(draw line) instruction, which uses both processors, the clip- 
per, the line drawing controller, the line-pattern generator 
and the BITBLT controller. © _ 


While simple instructions like ADD and MOV leave unmodi- 
fied any registers not specified in their encodings as being 
operands, the more complex drawing instructions like DRLN 
use certain registers of the AP and DP as implicit arguments 
or as temporary storage. These side effects of RGP drawing 
instructions are detailed in the Programmer’s Reference 
Manual PRM. 


Memory Organization and Data Types 


The RGP supports uniform: 24-bit addressing; the unit of 
storage in memory is the word. That is, the memory space 
of the RGP consists of 224 16-bit words. Parts of the memo- 
ry space can be designated by the user as Program, Data, 
Stack, Display Buffer, etc., at the user’s discretion; the RGP 
imposes no restrictions upon these allocations. 


For purposes of drawing, the RGP views the memory as 
being bit addressable (see Memory Addressing Section). In 
this case, the address is a 28-bit quantity, called a Bit Ad- 
dress, which consists of a 24-bit Word Address, left-shifted 
four bits and added to a 4-bit bit-selection field. 


Data elements treated by the RGP vary in length (number of 
bits) from 3 to 28 bits according to their function. However, 
they are always stored right justified (that is, justified toward 
bit 0) in registers or in memory (see Figure 6). Further, 
when stored in memory, multiple-word quantities are always 
stored with the least significant word at the lowest storage 
address. Supporting what is known as “‘Little-Endian’”’ archi- 
tecture. The address of a memory-resident quantity is the 
address of its least significant word. 


The following data element lengths are used by the RGP: 


word 16 bits or less 
word address 24 bits 
bit address 28 bits 


If an element is written from memory to a register that is 
shorter than the element, the more significant portion of the 
element is truncated to allow the less significant portion to 
fit into the register. If an element is written from a register to 
memory, the element is right justified in the memory location 
and unused bits are set to zero. 


Architectural Description (continued) 
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FIGURE 5. RGP Register Summary 
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28=BiT BIT ADDRESS 0 


IN A REGISTER: 


IN MEMORY: 


24=BIT WORD ADDRESS 


IN A REGISTER: 
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Architectural Description (continued) 


Address Processor Registers 


The registers of the Address Processor (Figure 7) are 28 bits in length. They are used to hold operands for the Address 
Processor or to hold Word Addresses or Bit Addresses. Registers AO, A1 and A2 are not subject to side effects of RG@P drawing 


instructions. 
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FIGURE 7. Address Processor Registers 


Data Processor Registers 


The registers of the Data Processor (Figure 8) are 16 bits in lenath. All of these registers are subject to side effects of RGP 
drawing instructions, but DO-5 are preserved through execution of all RGP instructions except those used for polygon filling. 


DSXT — RESERVED FOR USER® 
LOCTT ~ RESERVED FOR USER * 

D2 LLENT — RESERVED FOR USER® 

DS “LDEIT ~ RESERVED FOR USER * 

D4 LDE2T ~ RESERVED FOR USER” 

DS LERRT — RESERVED FOR USER® 

D6 FX/MICRO — RESERVED FOR POLGON FILL 

D7 FXT/MICRO ~ RESERVED FOR POLGON FILL 


= RESERVED FOR POLGON FILL 
_. = RESERVED FOR LINE LENGTH 

- BRESENHAM ERROR INCREMENT! 
= BRESENHAM ERROR INCREMENT2 
~ BRESENHAM ERROR 

= BITBLT WIDTH 


Di4 BHT/MICRO = BITBLT HEIGHT 
D15 MICRO = RESERVED FOR MICROCODE 


: : except during FILL instructions 
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FIGURE 8. Data Processor Registers 
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Architectural Description (Continued) 


Status/Control Registers 


The following registers indicate or control instruction or in- 
terrupt status of the RGP. They are each 16 bits in length. 


The Processor Status Register (PSR, see Figure 9) indi- 
cates the status returned as a result of instruction execu- 
tion. It also controls the enabling of maskable interrupts and 
the enabling of the clipper. The individual bits are defined as 
follows (all fields are asserted when set to one, negated 
when set to zero): 


Z: Data Processor Zero bit 
C: Data Processor Carry bit 
N: Data Processor Negative bit 
V: Data Processor Overflow bit 
AZ: Address Processor Zero bit 
Address Processor Carry bit 


K: An undrawn character code is present in register 
TNXC. 


W: The current x,y is within the bounds of the clipper. 


EIP: External Maskable Interrupt Pending 
VIP: Video Interrupt Pending 

CLE: Enable Clipper (1 to enable) 

PTE: Enable Pick Trap 

EIE: Enable External Maskable Interrupt 
VIE: Enable Video Interrupt 


See the PRM for more information 
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FIGURE 9. Processor Status Register 


Clipper Registers 
All drawing operations executed by the RGP are subject to 


the action of a rectangular clipper, when the clipper is en- . 


abled. The action of the clipper is to suppress the drawing of 
pixels outside its boundary. Clipping always takes place to 
bit resolution independently of the nature of the graphics 
primitive/operation. 


That is, all lines, polygons, BITBLTs and text primitives are 
clipped exactly to the coordinate values defined in the clip- 
per. 


The clipper consists of six 16-bit coordinate registers and a 
16-bit clipper status register, as follows: 


XMIN — The x-coordinate of the left boundary of the 
clipper 

XMAX__ The x-coordinate of the right boundary of the 
clipper ° 

YMIN _ The y-coordinate of the top boundary of the 
clipper 

YMAX_ The y-coordinate of the bottom boundary of the 
clipper 

DSX The x-coordinate of the current drawing point 

DSY The y-coordinate of the current drawing point 

CSR The 16-bit Clipper Status Register 


When enabled, the clipper is invoked by the RGP for all 
drawing instructions. The clipper may also be utilized by 
drawing routines written by the user. The organization of the 
CSR (Figure 10) facilitates this use of the clipper. The CSR 
contains the results of the four meaningful comparisons be- 
tween the clipper boundary registers and the current point 
registers. These results are expressed in ‘‘outcode” format 
in the four least significant bits of the CSR. 
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Architectural Description (Continued) 
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FIGURE 11. Line Style Generator 


Line Style Registers 


Those graphics primitives which are drawn by the RGP one 
pixel at a time, i.e., points, lines and polylines, are drawn 
subject to the action of a line pattern generator within the 
RGP. This mechanism produces a pattern along the length 
of a (presumed) line by suppressing the memory cycles as- 
sociated with drawing certain points along the line, accord- 
ing to values contained in the registers of the line pattern 
generator. The line pattern generator includes a mechanism 
for pattern magnification. The registers of the line pattern 
generator (Figure 17) are defined as follows: 


LPAT 16-bit Pattern Register 


LPATC 16-bit Pattern Control Register, contains three 
fields: 


LEN Pattern Length (4 bits) 
MAG Pattern Magnification (6 bits) 
CTR Pattern Counter (6 bits) 


During pointwise drawing operations, the RGP examines the 
least significant bit of the LPAT register to determine wheth- 
er or not to produce a memory cycle to draw the current 
point. A one in this position will enable drawing; a zero will 
inhibit drawing. At this point the memory cycle associated 
with drawing the current point will be executed if appropri- 
ate. 


The least significant bit of LPAT remains in place for the 
number of points specified in the MAG field of LPATC; then 
a circular right shift is performed on the n least significant 
bits of LPAT, where n is the value specified by the LEN field 
of LPATC. The CTR field of LPATC serves as a counter 
used by the RGP to implement magnification. See the PRM 
for more information. 


Since the length of the LPAT register is 16 bits, the maxi- 
mum pattern length (assuming a magnification of one, i.e., 
no magnification) is 16 bits. However, shorter patterns can 
be implemented by setting the LEN field to an appropriate 
value. The length of the pattern is the number of least signif- 
icant bits of LPAT that participate in the circular shift. LEN, 
MAG and CTR all have biases of + 1. That is, values of 0 in 
these fields will program the line pattern generator for a 
pattern of length 1 and a magnification of 1. 


The registers LPAT and LPATC are never reinitialized im- 
plicitly by RGP instructions. Once set, a pattern persists 
through all subsequent points, lines or polylines drawn. 
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Video Refresh Registers 

The video refresh registers (Figures 12 afd 13) contain the 
parameters for the programmable video refresh machine. 
They are of two types, distinguished by the mode of access- 
ing them. 

The first type is composed of eight registers that share a 
common register address (VIDEO); these can be accessed 
serially by successive MOV instructions. A 3-bit circular 
pointer (the VRX field of the VCR register) advances after 
each MOV and can be set to point to any of the eight regis- 
ters. | 

The second type consists of 3 registers with separate ad- 
dresses. The video refresh registers and their interpreta- 
tions are detailed below. The eight serially addressable reg- 
isters are shown first, in order. HSLT corresponds to a VRX 
value of 0, VBS to a value of 7. 
HSLT Horizontal Scan Line Time (12 bits) 

HSE Horizontal Sync End (12 bits). 

HBE Horizontal Blanking End (12 bits) 

HBS Horizontal Blanking Start (12 bits) 

VFT Vertical Frame Time (12 bits) 

VSE ‘Vertical Syne End (12 bits) 

VBE Vertical Blanking End (12 bits) 

VBS __ Vertical Blanking Start (12 bits) 
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FIGURE 12. Video Refresh Parameter Block 


HORIZONTAL SCAN LINE TIME 
HORIZONTAL SYNC END 
HORIZONTAL BLANK END 
HORIZONTAL BLANK START 
VERTICAL FRAME TIME 
VERTICAL SYNC END 
VERTICAL BLANK END 
VERTICAL BLANK START 
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FIGURE 13. Video Control Register 


Architectural Description (Continue) 


VCR Video Control Register (16 bits), composed of 
the following fields: 
VRX Video Register Index (3 bits) 
SE Scan Enable (1 bit) 
SM Master Sync (1 bit) 
Sl Interlaced Mode (1 bit) 
SC Composite Sync Mode (1 bit) 
RM Video Refresh Mode (2 bits) 
RAI Video Refresh Address Increment 
(2 bits) 
VBL The Vertical Blanking Flag 
ODD __ The Odd Video Field Flag 
DBB Display Buffer Starting Address (24 bits) 


DBWRP Display Buffer Warp (16 bits) 


THE GRAPHICS ENVIRONMENT 


This section discusses the conventions adopted by the RGP 
and the resultant environment within which all graphics op- 
erations take place. 


Memory Addressing — 


For operations, such as instruction and operand fetching, 
stack operations and interrupt service, the RGP accesses 
memory as a uniform space of 224 16-bit words, starting at 
address 0 and extending to address FFFFFF hex. Bytes are 
not directly addressable by the RGP. 


During drawing operations, the RGP can access individual 
bits in memory by means of a 28-bit quantity called a bit 
address. The correspondence between a bit address and 
the physical bit in memory is as follows: the 24 most signifi- 
cant bits of the bit address provide the address of the word 
in memory containing the addressed bit, while the four least- 
significant bits of the bit address select a bit within that 
word. If the four LSBs are zero, bit zero (i.e., the memory bit 
corresponding to the ADO pin of the RGP) is addressed. 
Thus, the bit-addressed memory starts at Bit 0 of Word 0 
and extends linearly to Bit 15 of Word 224 — 1. 


In practice, the RGP does not read and write individual 
memory bits directly. When it generates a 28-bit bit address, 
the most-significant 24 bits are used to access a 16-bit word 


in the linear address space, while the 4 least significant bits: 


go to the BPU as BO-3. The BPU contains hardware to read 


‘org’ + 1 


BIT ADDRESS "9" 


‘org’ + 2 


and write exactly one bit in the current word. In color sys- 
tems with one BPU per plane, all BPUs operate in parallel, 
reading and/or writing the corresponding bit in each plane 
(that is, all the bits of a pixel). As a result, the 28-bit quantity 
can be viewed as a pixel address as well as a bit address, 
though this interpretation is system hardware dependent 
and may in fact be operation-dependent. This is discussed 
more fully in the Multiple-Bit Pixels Section. 


Cartesian Drawing Space 


Drawing operations performed by the RGP execute in a logi- 
cal drawing space which is Cartesian. This x-y space is de- 
fined as having the origin in the upper left. Movement to the 
right increases the x-coordinate; movement downward in- 
creases the y-coordinate. Each axis may be a maximum of 
216 bits in length. Therefore, each coordinate may take on 
any integer value in the range of 0 to 216 — 1. Note that 
while the coordinate space can range from 0 to 216 — 1, 
vector drawing instructions (such as DRLN and DRPLN) 
have the requirement that a vector length not exceed 214 — 
1 pixels. 


A flexible mechanism controls the correspondence between 
the logical (x-y) location and the physical (bit) address in 
memory. Once the correspondence has been initialized by 
the user, the RGP maintains it throughout all drawing opera- 
tions. The correspondence can be changed explicitly at any 
time. . 


The correspondence is shown in Figure 14. The Cartesian 
origin (x= y= 0) corresponds to the hypothetical bit address 
org. Incrementing the logical x-coordinate increments the 
bit address. Incrementing the y-coordinate adds. the con- 
tents of the DSWRP register (the warp of the drawing 
space) to the bit address. The current bit address is main- 
tained in the DSAD register. Thus, 


DASD = org + DSY x DSWRP + DSX 


The correspondence between Cartesian coordinates and bit 
addresses is established by explicitly loading the registers 
DSX, DSY and DSAD. This correspondence is maintained 
by the RGP during the execution of all instructions if 
DSWRP contains a value appropriate for the current draw- 
ing space. Note that the drawing space can coincide with or 
overlap the display buffer or can be entirely distinct from it, 
and the two spaces can have separate warps. 


19<_ ‘org’ + DSWRP 
26<— 'org' + * DSWRP 


9 (Xi, Yj) 


‘org’ + Yj* DSWRP# Xi 


Note: ‘org’ is the hypothetical bit address 
corresponding to the Cartesian origin. 


*yY 
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FIGURE 14. Logical and Physical Addresses 
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Architectural Description (Continuea) 


Dot/Line/Polyline/Polygon Drawing — 
Operations 


During the execution of the RGP’s drawing instructions, the 
flow of data is between the memory and its associated BPU. 
The RGP’s drawing operations can be divided into two 
classes, based upon the operating mode of the BPU: 
BITBLT or LINE, according to the state of the L/B line of the 


-RGP and BPU(s). LINE mode is used for all pointwise draw- 


ing operations; BITBLT mode is used for all wordwise draw- 
ing operations. 


Pointwise drawing operations include reading and writing in- 
dividual bits (RDPT, DRPT), drawing lines (DRLN), drawing 
polylines (DRPLN) and drawing polygons (DRPGN). During 
all pointwise operations, the line pattern generator is active. 
The clipper (if enabled), is also active. 


The clipper performs its function by suppressing the memo- 
ry cycle associated with drawing the current point if that 
point is outside the clipper. : 


The line pattern generator operates in a continuous fashion 
within and between these instructions; it is not forced to a 
given state at the beginning of a pointwise drawing instruc- 
tion. Therefore, a line pattern will continue around the verti- 
ces of a polyline. Also, a curve drawn with a series of DRPT 
instructions will be subject to the line pattern generator. 
Since the line pattern generator cannot be disabled, it must 
be loaded with a solid (all ones) pattern if a solid line (no 
pattern) is desired. Similarly, a pattern of all zeros will Bu 
press all pointwise drawing. 


The RDPT instruction is not affected bi the clipper or line 
pattern generator, nor does it affect them. 


Se BSWRP a 


Source Bitmap 
TL/F/9427-16 


BSAD — 28 bits, bit address 
-— BITBLT Source Address 


BSWRP — 24 bits, word count 
— BITBLT Source Warp 


DSAD — 28 bits, bit address 
— Drawing Space Address 


DSWRP— 24 bits, word count 
— Drawing Space Warp 


BITBLT Operations 


Bit Boundary Block Transfer (BITBLT) operations are car- 
ried out by the RGP with the BPU(s) in BITBLT mode. 
BITBLT is the performance of sixteen specific bitwise logical 
operations between two rectangular arrays of bits, each 
having the same height and width and an arbitrary bit align- 
ment (see Figure 15). In RGP-based color graphics sys- - 
tems, each DP8511 BPU participating in the BITBLT opera- 
tion can be programmed to perform its logical operation in- 
dependently of the others. This permits such effects as 
transparency and foreground-background color rendering. 


During BITBLT, the RGP produces all memory (word) ad- 
dresses and the set of BPU control signals necessary to 
control the FIFO and masking functions. The BPU is respon- 
sible for assembling and shifting source words, storing them 
in the FIFO, receiving destination words, logically combining 
them with the corresponding shifted source words, (masking 
any destination bits necessary) and returning the result to 
the destination. 


As during pointwise operations, the clipper is active and 
clips the destination to pixel resolution. In the process, the 
BITBLT left and right masks may change from those pre- 
dicted by the destination alignment alone. Thus, the mini- 
mum number of memory cycles required, considering the 
clipper, is always performed. 

The two directions of execution of BITBLTs can be specified 
independently: right to left (versus left to right) and bottom 
to top (versus top to bottom). 


Destination Bitmap 


TL/F/9427—48 
—16 bits, bit count 


BWD BWD(source) = BWD(dest) 
— BITBLT Width BHT(source) = BHt(dest) 
BHT — 16 bits, line count 


— BITBLT Height 


| FIGURE 15. BITBLT Parameter Definitions 


Architectural Description (Continuea) 


The reading of the source rectangle and the reading of the 
destination rectangle can be suppressed by the user inde- 
pendently of one another. This should be done after taking 
into account the requirements of the complete set of logic 
operations being performed by the BPUs in the system. For 
example, if no function other than Function 0 (fill with zeros) 
or Function 15 (fill with ones) is being performed, both 
source and destination reads may be suppressed. If a two- 
operand function, such as XOR, is being performed by any 
of the BPUs, both the source and destination must be read. 
Various intermediate cases may arise. the leftmost and 
rightmost words of the destination are always read, since 
this is necessary in order to use the BITBLT left/right 
masks. 


Each BITBLT rectangle (source and destination) has a start- 
ing address (BSAD and DSAB) and a warp (BSWRP and 
DSWRP); they have a common width and height (BWD and 
BHT). These must be initialized prior to executing the 
BITBLT instructions. Additionally, DSX and DSY must be in 
correspondence with DSAD. The use of independent warps 
in the source and destination spaces permits packing and 
unpacking of bitmaps (e.g., for fonts), providing a better en- 
vironment for memory pool management and generally 
leading to more efficient use of memory. 


Polygon-Fill Operations 


The RGP supports the drawing of pattern-filled vertically- 
convex polygons via the FILLA and FILLT instructions. (See 
the PRM for more information.) The argument of the FILLA 
instruction is a list of vertices in counter-clockwise drawing 
order. The first (and last) vertex of the polygon is the current 
point, DSX, DSY, which must be an uppermost vertex of the 
polyon. The RGP’s filling algorithm traverses the data struc- 
ture, determining the polygon’s intercept points with each 


_TATB register 


iieecsadiiiios! 


TFAD register 


font bitmap: 


horizontal line in the vertical extent of the polygon. Each line 
is filled, using a pattern contained in the BPU’s FIFO. 


The FILLT instruction provides a similar mechanism for fill- 
ing trapezoids with horizontal top and bottom edges. The 
key feature of this instruction is that the lines that form the 
right and left edges of the trapezoid need not start and end 
at the intersections with the top and bottom edges of the 
trapezoid. . 


Each horizontal line used in filling a polygon is treated as a 
BITBLT of one-bit height. The first horizontal line is filled by 
logically combining the first word from the BPU’s FIFO with 
the successive words of the horizontal line. the second line 
is filled, in the same manner, utilizing the next word from the 
BPU’s FIFO, etc. The FIFO read is non-destructive; as a 
result, the pattern repeats vertically every 16 words. Since 
the FIFO is 16 bits wide, the pattern is 16 by 16 bits. 


Text Operations 


Text primitives are handled by the RGP within an environ- 
ment designed to support flexibly and efficiently such fea- 
tures as multiple fonts, multiple type sizes and styles, sub- 
and superscripts and proportional spacing. This environ- 
ment maps cleanly into the logical x-y drawing space and 
does not require dedicated text planes. 


Text is rendered from monochrome or full color storage into 
the drawing space by BITBLT. Therefore, a bitwise logical 
operation takes place between the text and the current in- 
formation in the drawing space. The text instructions deter- 
mine from the current state of the text environment (a set of 
pointers and data structures) the necessary BITBLT param- 
eters, perform the BITBLT (subject to bitwise clipping, if the 
clipper is enabled) and update the state of the text environ- 
ment in preparation for the next text instruction. 


Text Attribute Table: 


char 0 


char 1 


_ pointer_low char n (letter "Y") 
| cmd] pointer_hi 


last_char 
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FIGURE 16. Text Address Calculations 
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Architectural Description (Continue) 
The text environment is maintained in the AP as follows: 

TFAD . Text Font Base Address (A11) 

-TNXC... Next Text Character (A3) 

TCDB Text Character Descriptor Table Base (A10) 
The major data structure used in support of text is the Text 
Character Descriptor Table (Figure 716). This table contains 
a four-word entry for each character in the current font. 
Character codes are interpreted by the text instructions as 
offsets into the table. This gives the RGP access to the 
source bitmap of the character and additional information 
used in rendering the character and updating the text envi- 
ronment. 

The entry consists of: 


POINTER A relative pointer (28 bits) to the. source BHTiep 
key A field specifying the interpretation of the table 
entry: 
—Render as portrait mode 
—Render as landscape mode 
—Trap through INTB + 8 
WD Source bitmap width (8 bits) 
HT ~ Source bitmap height (8 bits) 
Dx X-offset (signed, 8 bits) 
DY Y-offset (signed, 8 bits). 


lf the character | is to be rendered, the source bitmap is tak- 
en from TFAD + POINTER, the logical destination is (DSX 
+ DX, DSY + DY), and the BITBLT height and with | are 
HT and WD. 


The environment will be updated as follows in the portrlt 
mode: 


DSX = DSX + DX + WD 
DSY = DSY 
DSAD = DSAD + DX + WD 


The environment will be updated as follows in the land-— 


scape mode: 
DSX = DSX 
DSY = DSY + HT + DY 
DSAD = DSAD + (HT + DY) * DSWRP 


The subroutine call (trap) option is a general-purpose es- 
cape mechanism for handling EOT characters, line feeds, 
characters larger than 256 by 256 pixels, etc. When the key 


field indicates trap, the RGP does not interpret the other 


fields of the entry, leaving them free for other purposes. — 


Characters can have eight-bit codes or sixteen-bit codes. 
Eight-bit characters can be packed into sixteen-bit words 
and unpacked by means of the DCL or DCH instruction. 


See the PRM for more information. 


Multiple-Bit Pixels : 


The RGP hardware supports niedbed renaedng of graph- 
ics primitives into a drawing space. In the process, it main- 
tains the correspondence between logical and physical 
memory. This correspondence is independent of pixel size 
(depth, number of bits per pixel). In fact, the RGP architec- 
ture has no parameters for pixel size. This system-level pa- 
rameter is effectively a hardware/software layer applied 
above the RGP architecture. This allows designers of 
AGCS-based systems some flexibility i in trading off cost and 
performance. — 


The lowest cost approach is to use a single BPU per sys- 
tem, time-multiplexing it across the bit planes. Each graph- 
ics primitive must be rendered once in each bit plane. This 
can be done by assigning a block of memory addresses to 
each bit plane, and considering each bit plane as an inde- 
pendent drawing space. By changing the physical address 
corresponding to the drawing origin (modifying DSAD, typi- 
cally by means of the SETPT instruction) the same display 
list can be executed in each plane. 


The highest performance solution is obtained by using one 
BPU for each plane. The display list is executed once, with 
all planes being updated simultaneously. One way to imple- 
ment this is for each plane to have a unique block of ad- 
dresses where the corresponding words in each plane have 
addresses whose m least-signficant bits match, where 2™ is 
greater than or equal to the size of the plane. Then the 
higher-order address bits (above m — 1) can be decoded to 
select the plane, and external logic can address the planes 
individually or in parallel (selecting some or all of the planes 
simultaneously). This requires a mechanism to isolate or 
connect, as required, the local data buses of the individual 
planes, typically a TRI-STATE® buffer per plane, plus the 
logic to control the buffers and to distribute memory control 
signals, typically RAS, among the planes. This logic is re- 
ferred to as the video plane control logic. 

An intermediate solution might use a BPU for every other 
plane, multiplexing each BPU two ways. In this case, the 
display list would be executed twice. Other intermediate so- 
lutions are possible as well. 


VIDEO REFRESH 


The RGP’s video refresh logic is designed to support a vari- 
ety of raster graphics applications, including non-video ap- 
plications such as laser printers. This is accomplished by 
means of flexible programming of the video parameters and 
through a section of video refresh modes. 


The user must determine the appropriate video parameters 
for the chosen display device, additionally taking into ac- 
count the chosen values for PCLK and LCLK (the pixel 


Architectural Description (continued) 


and load clocks, respectively). These values are then writ- 
ten to the video parameter biock. 


Additional parameters associated with the video refresh log- 
ic are contained in the RGP’s Video Control Register (VCR). 


Signal Definitions 


The RGP’s video refresh logic is driven by a single signal, 
LCLK. From this clock, according to its operating modes, 
the RGP generates the signals required to directly control a 
video monitor: Horizontal Sync, Vertical Sync and compos- 
ite Blanking. Bus cycles required for video refresh purposes 
are also initiated by this logic. 


Horizontal Sync causes the monitor to initiate the horizontal 
retrace period. Vertical Sync causes the monitor to initiate 
the vertical retrace period. Blanking causes the monitor to 
shut off the video output, in order to prevent writing to the 
screen during retrace periods. This prevents spurious re- 
trace lines from appearing in the display. 


Video Parameter Definitions 


The correspondence between the register-resident parame- 
ters and the video waveforms is depicted in Figure 17. Hori- 
zontal parameters are expressed in units of LCLKs; vertical 
parameters are expressed in units of scan lines. 


QO HSYNCE 


HORIZONTAL SYNC 


HORIZONTAL BLANKING 


VERTICAL SYNC 


VERTICAL BLANKING 


BLANKING 


BLANKING 


The video refresh logic is a pair of counter-driven machines. 
The first, responsible for horizontal sync and blanking, incre- 
ments on each LCLK. Its counting sequence is 0,1, ... 
HSLT,O,1 .... As a result, a scan line (including sync) is 
HSLT + 1 LCLKs in duration. 


The second counter-driven machine is responsible for gen- 
eration of the vertical sync and blanking. It increments once 
per scan line, on the LCLK during which the horizontal coun- 
ter is cleared. Its counting sequence is 0,1,... VFT — 1,0,1 
.... As aresult, a frame (including sync) is VFT scan lines in 
length. 


Refer to the PRM for more information. 


interlaced Refresh 

The video refresh logic may be set up for interlaced scan 
mode by setting the Si-bit of the Video Control Register 
(VCR). In this mode, the screen is refreshed in two fields, 
the even and odd fields. 

The even field consists of all even-numbered lines; the odd 
field consists of all odd-numbered lines. 


interlaced sync is generated by a slight modification to the 
vertical sync as shown in Figure 78. 


HSLT 0 


ACTIVE VIDEO 


BLANKING 


ACTIVE VIDEO 


BLANKING 
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FIGURE 17. Video Parameter Definitions 


EVEN FRAME 


| 1/2 HSLT 


ODD FRAME 
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FIGURE 18. Interlaced Video Timing 
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Architectural Description (Continue) 


External HSYNC 
(From master syne source) 


External VSYNC 


FIGURE 19. External Svicheenbation (Genlock) 


Video Refresh Modes 


The RGP provides four operating modes for the video re- 
fresh logic. The fundamental difference among the various 
modes is the way in which they allocate the available mem- 
ory bandwidth between update (drawing) operations and 
video refresh operations. In this context, ‘drawing opera- 
tions” refers to ALL operations other than video refresh. 
The mode is selected by means of the RM field of the Video 
Control Register (VCR). 


ModeOQ No memory cycles are generated for refresh pur- 
poses. All memory bandwidth is available for 
drawing. 


During active (non-blanked) video time, each 
LCLK causes the RGP to request one bus cycle 
for refresh purposes. The request is indicated by 
the RGP’s assertion of DRREQ (Display Refresh 
Request) output. Drawing operations may take 
place at all other times. 


During active video time a bus cycle is requested 
on each LCLK, alternating between video refresh 
cycles and drawing cycles. During planing: only 
drawing cycles are generated. 


A refresh cycle is requested by the RGP at the 
beginning of horizontal sync, in preparation for 
the next scan line. A refresh cycle will also be 
requested when the refresh address falls on a 
256 word boundary. Drawing cycles can be gen- 
erated at all other times. This mode is intended to 
support Video DRAMs. 


External Sync Sources 


Mode 1 


Mode 2 


Mode 3 


The video refresh can be programmed for synchronization | 


to an external source via the SM-bit of the Video Control 
Register (VCR). When in the Master Sync mode, the RGP 
generates all syncs and blanking signals with ‘nO B extemal 
reference (except LCLK). 


When in Slave Sync mode (see Figure 79), the RGP’s Verti- 


cal Sync line becomes an input and is driven by the vertical 


1-20 


| Register 


Varactor 


Pao | Crystal 


Loop Filter 


> HSYNC Out 
(To monitor) 


VSYNC out a 
TL/F/9427-20 


sync from an external source. The negative transition of this 
input will clear the vertical counter in the RGP’s video re- 
fresh logic. Thus, the RGP is Sorced into vertical ‘sync with 
the outside source, 


A phase-locked loop in the VCG forces the RGP’s Horizon- 
tal Sync output into synchronization with the externally-sup- 
plied horizontal sync by adjusting the frequency of LCLK 


(indirectly; the RGP’s processor clock is actually adjusted 
- directly. LCLK is divided down from the processor clock). 


INSTRUCTION SET 


Addressing Modes 


The Address and Data processors of the RGP are register- 
to-register machines. MOV instructions are included in the 
instruction set to provide a mechanism for transfers be- 
tween registers and memory. MOV instructions can use the 


‘following addressing modes, which are also supported, to 
"varying degrees, by the remainder of the instruction set: 


Immediate ~ The operand is contained in the word(s) of 
memory immediately foloNend the instruc- 


tion. 


The operand is contained in the memory lo- 
cation(s) pointed to by the 24-bit address 
quantity contained in the two memory words 

immediately following the current instruction. 
The least significant word of the address is 
stored at the lowest address. 


The operand is contained in the memory lo- 
cation(s) pointed to by the sum of the con- 
tents of the indicated AP register and the 
twos complement displacement quantity 
contained in the word immediately following 
the current instruction. 


The operand is contained at the memory lo- 
cation(s) pointed to by the contents of the 
indicated AP register, subject to the following 
convention: The AP register is incremented, 
by the number of words in the operand, after 
performing the operation. 


Direct 


Relative 


Register 
Indirect with 
Post- 
Increment 


Architectural Description (Continued) 


Register The operand is contained at the memory 

Indirect with location(s) pointed to by the contents of 

Pre-Decrement the indicated AP register subject to the 
following convention: The AP register is 
decremented, by the number of words in 
the operand, prior to performing the oper- 
ation. 


instruction Set Summary 


The instruction set of the RGP can be divided into the fol- 
lowing categories: 


Load and Instructions that provide transfers between 
Store registers and memory. 

Instructions 

General Register-to-register operations that take 
Processing _ place in either the AP or DP. 

Instructions 

Program Instructions that control the flow of programs 
Control and the general software environment. 
Instructions 

Graphics Instructions that draw or that control drawing 
Instructions parameters only. 


Load and Store Instructions 


MOV _Load register from memory or store from register- 
into memory. 


General Processing Instructions 


The General Processing instructions are those that execute 
within either the AP or the DP, or from DP to the U Bank or 
vice versa. They utilize register-to-register addressing only. 
All of these instructions will execute in the DP; some will 
execute in the AP as well. Both operands must belong to 
the same processor, except for those that act on registers 
from the DP and U Bank. 


ADD Add (AP or DP) 
ADDC Add with carry 

AND Bitwise And 

CMP Compare 

DEC Decrement (AP or DP) 
INC Increment (AP or DP) 
MOV Move within DP 


Move within AP 
Move between DP and U Bank 
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EX Exchange registers within AP 
Exchange registers within DP 
Exchange registers between DP and U-bank 


EXLN Exchange six registers used in line drawing with 
alternate register block 

MULS Multiply signed 

MULU Multiply unsigned 

NOT Bitwise Complement 

OR Bitwise Or 

ROLC Rotate Left with Carry 

RORC Rotate Right with Carry 

SHL Shift Left 

SHRA _ Shift Right, Arithmetic 

SHRL Shift Right, Logical (AP or DP) 

SUB Subtract (AP or DP) 

SUBC —_ Subtract with Carry 

TEST Test with Mask 

XOR Bitwise Exclusive-Or 


Program Control Instructions 


The Program Control instructions affect the program flow by 
causing non-sequential instruction execution or by suspend- 
ing processing. A number of these instructions execute con- 
ditionally; these are indicated by the ce field in their mne- 
monics, which is understood to mean one of the following 
interpretations of the condition code bits of the PSR: 


Not_Zero 
No__Carry 
No__oVerflow 
Not__Higher 
Not_Negative 
Less or Equal 


Zero 

Carry 
oVerflow 
Higher 
Negative 
Greater Than 
Less Than 
ApZero 
ApCarry 
Within 


*(K = 1 implies “character available in TNXC’”’) 


Tt(NK=1 implies “NOT” character available in TNXC) 


Greater or Equal 
Not_ApZero 
No__ApCarry 
NotWithin 
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ACKVI Acknowledge VIDEO. untespt (clear VIP) 
BRKn Break (trap) n 

Bec — Branch conditionally 

BR _ Branch unconditionally 


CALLcc Cail subroutine conditionally 
CALL Call subroutine unconditionally 
HALT Halt instruction execution _ 


INITB —_Initalize BPU(s), i.e., force RSTO low for two clock 
periods 


NOP No operation 

RET Return from subroutine unconditionally 
RETcc Return from subroutine conditionally 
RETI Return from interrupt 


Graphics Instructions 


The Graphics instructions include two classes: those that 
actually draw, that is, those by means of which the RGP 
modifies memory in the drawing space by means of the 
BPU(s), and those that do setup in preparation for drawing. 
The setup instructions are considered first in the following 
listing. 


The RGP maintains the concept of current point, analogous 
to the current location of the pen on a conventional plotter. 
Certain graphics instructions can be performed in one of 
two modes: relative to the current point (relative) and rela- 
tive to the origin (absolute). Instructions that can be per- 
formed in either mode are shown below in both mnemonic 
forms..The A suffix indicates absolute mode. 


SETPT | Set drawing point, given DSAD, - 

SETPTA DSWRP, x and y 

SETPTS. Set source point, given BSAD, BSWRP, 

a x and y 
SETLN _ Calculate and retain parameters for 
SETLNA Bresenham line-drawing algorithm, but 
do not draw fine 

BT ulsd Perform BITBLT of size BWD and BHT, 

u = {UD} from source at BSAD with warp 
= {LR} BSWRP, to destination at DSAD with 

s = {MB} _ warp DSWRP. u and / control BITBLT 

d= {CW} direction. s and d control reading 

_ source and destination. 

DRPT Draw point at DSAD 

RDPT Read Point 

DRLN Draw line 

DRLNA 


1-22 


DRLNS | Draw line using previously-calculated 


parameters (see SEEN SEEN) 

DRPLN ; Draw polyline 

DRPLNA © 

DRPGN Draw polygon » 

DRPGNA . | 

FILLAd Fill polygon. d controls reading the des- 
tination during the fill. 

FILLTd ' Fill trapezoid, using previously estab- 
lished register values. od controls read- 

7 ing the destination during the fill. 

DCHu/ld Draw character high. u, / and d control 
BITBLT direction and reading the desti- 
nation 

DCLu/d Draw character low. u, / and d control 
BITBLT direction and reading the desti- 
nation 

DCNu/d Draw character, next. u, / and d control 


BITBLT direction and reading the desti- 
nation 


DCWuld [An+ +] Draw character, word. u, /and d control 
BITBLT direction and reading the desti- 
nation 


INITIAL OPERATION 


When the RGP is placed into the reset state jess AGTH, 
several registers revert to known states, and instruction ex- 
ecution begins. The following registers are initialized: 


PC Points to location 0 
PSR Bits VIE, EIE, PTE and CLE are zero 
VCR Fields VRX, SE, SM, SI, SC, RM and RAI are zero 


Since interrupts cannot be processed properly before cer- 
tain initialization has been performed, all interrupts are dis- 
abled when the RGP is placed into the reset state. Maska- 
ble interrupts must be enabled explicitly; while the non- 
maskable interrupt becomes enabled as soon as a new val- 
ue is stored into PSR. 


The RGP begins execution by alternately fetching and exe- 
cuting instructions starting at address 0. Typically, this initial 
code is responsible for establishing base pointers for data 
and stack areas and generally establishing the software en- 
vironment, as with any microprocessor. Next; necessary 
data structures and pointers are initialized to support the 
graphics environment. Any peripherals that reside in the 
RGP’s memory space can be initialized at this point. Finally, 
the internal video refresh controller is programmed in a 
manner consistent with the CRT monitor being used (if any) 
and is enabled. 


Architectural Description (Continued) 
Operation beyond this point is highly implementation-depen- 
dent. In a workstation application, the RGP might execute a 
communications protocol with another processor upstream 
in the graphics pipeline, awaiting the arrival of a display list 
to be executed. Upon receipt, the RGP would directly exe- 
cute or interpret the display list, rasterizing graphics primi- 
tives into the display buffer. Upon executing the final display 
list instruction, the RGP would typically signal completion, 
thus completing the protocol with the upstream processor 
and allowing the process to continue. 


Alternatively, in a standalone application like a graphics ter- 
minal, the RGP might enter a control program, servicing pe- 
ripherals and executing a command interpreter. Here, the 
RGP would be responsible for keyboard, mouse and UART 
service; at the same time, the RGP might execute a graph- 
ics language interpreter, responding to remote host com- 
mands by maintaining a graphics environment and drawing 
into the display buffer. 


SYSTEM-LEVEL IMPLEMENTATION OF THE DP8500 
FROM A HARDWARE PERSPECTIVE 


In order to maintain high efficiency in drawing operations, 
memory access overhead must be minimized. One of the 
most efficient and economical techniques for reducing 
memory access overhead is through use of the pagemode 
memory access principle. Here, a memory page physically 
maps to a selected DRAM row, or page. Design of the RGP 
was made with the pagemode principle in mind. All BITBLT 
operations take place in a burst format which directly sup- 
ports pagemode. This process is realized by a sixteen word 
deep FIFO located in the core of the BPU. During the 
BITBLT process, the FIFO serves as a temporary storage 
buffer for storing one operand of the BITBLT. As drawing 
operations take place, the RGP asserts a pagebreak detect 
mechanism which serves as a system end of page detector. 
It is through system use of this system-level indicator that 
efficient transitions from memory page to page come about. 


In addition to the various access techniques for retrieving 
memory data during drawing operations, a BITBLT opera- 
tion must deal with the task of calculating relative bit offsets 
between the source and destination word locations. In situa- 
tions where the relative bit offset between the source and 
destination is zero, the number of source reads will always 
match the number of destination writes. BITBLT operations 
of this nature are known as ‘linear.’ However, ‘non-linear’ 
situations do exist such as the generation of an extra source 
read to one destination write. The converse of this is also 
true: where one source read is accompanied by two destina- 
tion writes. A set of rules describes the RGP principle of 
BITBLT: 

1. Only the minimum required number of destination writes, 
or destination read-modify-writes will be generated in or- 
der to perform a given BITBLT. 

2. Read-modify-write operations will always be generated 

-on the left and right BITBLT edge boundaries. 

. The appropriate masking function(s) are always applied 

on BITBLT edge boundaries. 

4. In situations where the width of the BITBLT is 16 bits or 
less, a read-modify-write occurs, and both the Left and 
Right Masks will be applied. 

A number of parameters exist in the BITBLT (BT) and FILL 

instructions. These parameters instruct the RGP on which 


GO 
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bus cycles to produce. In addition they determine the gener- 
ation of the FIFO Read and Write strobes. In BITBLT opera- 


-tions which require BITBLT operand source read(s), the 


RGP enters the BITBLT drawing “source” mode. Here, 
source data is fetched from memory in a “burst” fashion, 
shifted (if necessary), and stacked in the FIFO as the FWR 
input is strobed. Stacking continues until either the FIFO 
fills, or the width of the BITBLT is reached. At such time, the 
RGP then enters the drawing ‘destination’ mode. For the 
majority of bus operations, the RGP remains in the “‘destina- 
tion’ mode, only entering the “source” mode when source 
drawing is specified as part of the BITBLT instruction. 


The DP8511 BPU is designed to operate on the source, 
destination or combinations of both BITBLT operands as 
specified in the RGP generated bus cycles, and the function 
code placed in the FSE register. A typical example of an 
one-operand BITBLT is a direct drawing write-only opera- 
tion. Here, the operand is sourced from the destination loca- 
tion, passed through the BPU and written back to the same 
location. 


Two-operand BITBLTs involve reading the destination 
source operand into the BPU during the read portion of the 
RMW cycle. When the ‘modify’ portion is reached, the 
FIFO output produces the second operand. Both operands 
are then logically combined in the BPU’s Logic Unit as dic- 
tated by the function code in the FSE register. The opera- 
tion is made complete by the write back to memory. In short, 
it is the value placed in FSE and the type of bus cycle gen- 
erated which determine the number of operands used in a 
BITBLT. It should be understood that the purpose of a FRD 
strobe is to produce a fresh second source operand. If FRD 
is not strobed, the FIFO output will reflect the last FRD 
strobed output. The methodology for producing FRD is de- 
scribed as follows. 


In many situations where a polygon fill is taking place, the 
FIFO output contents are only updated after the width of the 
BITBLT is reached. This results in a replicating pattern 
which matches the contents of the 16 x 16 FIFO. 


Because the FIFO read and write (FRD and FWR) strobes 

play such a vital role in storing and retrieving BITBLT oper- 

ands, generation of FRD and FWR are based on the follow- 

ing rules: 

1. FWR is always generated whenever a BITBLT source 
read cycle is generated, 


2. If a BITBLT source read occurred, then FRD will automat- 
ically be generated for either a destination read-modify- 
write or direct memory write only operations, 


3. If the BITBLT source read cycle is suppressed, no FRDs 
will be generated for the destination portion of the 
BITBLT cycle. 


4. The FIFO output will always reflect the current value of 
the FRD pointer. 


See the Programmers Reference Manual for details. 


BUS OPERATION 


This section describes the various bus cycles generated by 
the RGP. For clarity in documentation, Bus Status refers to 
the RGP generated Bus Status lines (BS1-0). Read and 
Write refer to the RGP generated Read (RD) and Write 
(WR). It is to be understood that RD and WR serve as status 
only. 
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Architectural Description (Continued) 


SYSTEM CONTROL MECHANISM 


The key elements in designing with the AGCS are the RGP, 
BPU and Bus State Machine (BSM). As each element per- 
forms a specific task, the net result is a tightly coupled func- 
tional hardware array. The primary function of the RGP is to 
serve as an efficient rendering engine: generating a global 
address and system status. The task of the BPU is to serve 
as a data path channel. 


With current technology, the system designer has a wide 


variety of system memory access techniques to chose from. 
Optimazation comes about only if the system designer fol- 


lows all the special rules which govern each memory access - 


requirement. As technology improves, the rules change. 
The majority of microprocessor products on the mark today 
contain an embedded memory access protocol. Although 
the initial system hardware implementation may be some- 
what more straight forward, the result often ends in a prod- 
uct with restricted, or inefficient access protocols. Fortu- 
nately, the underlying philosophy of the RGP is to allow the 
system designer the freedom to optimize specific memory 
access protocols for the application at hand. 


In order to ensure the most efficient access protocol, opti- 
mazation must be carried out by the system itself. By using 
the RGP as a system status generator, status information is 
conveyed to an external Bus State Machine (BSM) which 
carries out the timing and system control for a given bus 
status. In effect, as status is generated, the BSM becomes a 
hardware shell, which gives the RGP a unique system per- 
sonality. 


As each system is developed, a unique personality emerges 
which best suits the system from the standpoint of improve- 
ments in technology, and access efficiency. 


The purpose of this section is to render a process whereby 
a system designer can understand the underlying criteria for 
designing the BSM. First and foremost in the design pro- 
cess, the designer must understand the functional role of 
the BSM in a system. Only when the demands of a particu- 
lar system design are clearly understood is. it possible to 
tailor the BSM to optimize memory access efficiently. It 
should be understood that different system designs will 
have different criteria. This section will cover all of the basic 
elements for a BSM design. 


BSM DESIGN METHODOLOGY 


In designing the BSM, the first stage the system designer 
must go through is to gain a thorough understanding of the 
various relationships, or roles the BSM must play in order to 
function with various members of the system. This proce- 
dure is best understood by concentrating on the following 
core issues: 

1. The role of the BSM to the RPG, and visa-versa. 

2. The role of the BSM to the HOST. | 

3. The role of the BSM to a memory refresh request. 

4. The role of the BSM to the BPU. 7 

5. And finally, the role of the BSM to the system memory. 
On system reset, the BSM is expected to enter an initializa- 
tion loop. The purpose of this loop is to allow the BSM to 
enter a known state, and to synchronize itself to the system. 
Once synchronized, the BSM begins to sample the Bus 
Status and ALE. From a system perspective, the purpose of 
ALE is to demultiplex the address and data lines. From a 
BSM perspective, ALE serves as an indication that a bus 
cycle has just begun. In effect, the BSM uses ALE to syn- 


chronize itself to the RGP. Because the BSM understands 
the exact nature of the RGP and system bus latency associ- 
ated with each bus cycle, it knows when to assert WAIT (if 
necessary) and when to place data on the system global 
data bus if the RGP is doing a Read or Write. The BSM must 
generate feedback in the form of WAIT inorder to ensure 
that the RGP remains synchronized to the system. 

Various systems also require that a HOST gain mastership 
of the bus. Here, the same rules apply as stated above. A 
bus cycle start is required (often this is generated by the 
assertion of the bus acknowledge signal), and WAIT is ap- 
plied until the data transfer is complete. 

Accessing the system bus can take place in a variety of 
forms. Often the HOST forces the RGP completely off the 
bus via the HOLD/HOLDA mechanism, other times a cycle 
stealing approach is taken. Cycle stealing allows the HOST 
to temporarily gain access to the system bus via a direct 
arbitration mechanism built into the BSM, in effect, directly 
bypassing the RGP. If a cycle stealing approach is taken the 
BSM must delay or WAIT any pending RGP requests during 
the HOST transfer. | 

One of the main purposes of the BSM is to arbitrate for 
various incoming processes at the system-level. Not only 
must it arbitrate for mastership, but it may also need to arbi- 
trate for memory refresh requests. Often, the RAS and CAS 
functions are included as part of the.BSM therefore, a proto- 
col must be established between the memory refresh pre- 
empting device and the BSM. Upon assertion of a memory 
refresh request, the BSM is required to arbitrate itself for the 
refresh cycle, WAIT the Bus Master (should a bus cycle be 
pending) and perform the necessary refresh. ? 


During BPU setup and drawing, the BSM is required to apply 
various strobes to the BPU. One of the main purposes of the 


_ functional timing section is to illustrate when the system 
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strobes are asserted. For non-drawing operations, the BPU 
Control Register Enable (CRE) and Function Select Enable 
(FSE) strobes are asserted. For drawing operations, Data 
Latch Enable (DLE), Pixel Data Latch Enable (PDLE), Data 
Output Enable (DOE) and Pixel Data Output Enable (POE) 
strobes are asserted. 


From a system-level perspective, the BSM is required to 

apply the following system strobes in order to execute the 

various system functions: 

Non-drawing functions (RGP and HOST); 

1. Arbitrate and execute system control Video Refresh func- 
_ tions, 

2. arbitrate and produce system control Instruction, Oper- 
and fetch and write access functions. 

Drawing functions(RGP): 

3. Arbitrate and execute system control drawing source 
read, and destination write operations, 

4. Arbitrate and execute system control drawing destination 
read-modify-write operations. 


Once the basic system arbitration tasks of the BSM are un- 
derstood, the next step is to develop the overall structure in 
such a way that the access priority mechanism of both the 
BSM and RGP (see the section on RGP BUS PRIORITY 
MECHANISM) correlate. Figure 20 illustrates the priority 
mechanism of the BSM: with Display Refresh being the 
highest priority and drawing being the lowest. This type of 
BSM structure guarantees that the BSM will execute high 
priority RGP Video Refresh requests in a timely fashion 
without unexpected latency or hindrance from other devic- 
es. 


Architectural Description (continued) 
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FIGURE 20. Flow Diagram of the Bus State Machine 
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Once an understanding of the role, and arbitration of the 
BSM is worked out, a study begins on the various loop, or 
branch requirements necessary for the BSM to execute a 
given function. It is within each the loop that the various 
memory access techniques are employed. The number of 
states required in executing the loop itself is often reflected 
in the latency associated with each type of memory access. 


As the design progresses, the designer gains a deeper in- 
sight as to how the various memory access begin to inter- 
act, and the functional role of the RGP Page-Break mecha- 
nism in memory burst operations. Interaction of access cy- 
cles are best understood by understanding the sequence of 
RGP drawing bus cycles produced, and the effects on the 
system memory. 


Figure 20 gives the system designer a conceptual feel for 
both the structure and basic system functions required of 
the BSM. Of special interest are the Display Refresh and 
Drawing loops. In applications which require real-time dis- 
play (or screen) refresh, data must be downloaded to the 
video pipe in such a way as to provide a contiguous stream 
of valid data to the image device to avoid image folding or 
tearing. 


The methodology of producing real-time display refresh is 


‘known as Mid-scan line load. Including Mid-scan line load 


as part of the BSM requires a nested inner synchronization 
loop as part of the main Display Refresh loop. Under control 
of the RGP DRREQ and an external counter, the inner loop 
of the BSM serves to synchronize the BSM to the data con- 
tents of the active video pipe. After synchronization, down- 
loading can take place at exactly the prescribed time. Detail 
of this operation is covered in the application note, Mid- 
Scan-Line-Load Techniques using the DP8500 Raster 
Graphics Processor. 


Recently, VRAM technology has introduced a new tech- 
nique for loading the serial shift registers called Split Regis- 
ter Load. For systems employing such devices, direct Data 
Transfer operations can take place without the necessity for 
BSM system synchronization. 


In summary, the BSM understands the system. It is respon- 
sible for receiving instruction from the bus master, and exe- 
cuting the control commands from the system-level. As 
technology changes, so to does the BSM. As a result the 
RGP allows the system designer to employ any access 
technique which best serves to optimize the design. 


Included in the bus cycle section are a number of functional 
timing diagrams. The role of the functional timing diagram is 
to express, at the conceptual timing diagram level, the BSM 
placement of system control strobes for the required bus 
cycles. 


OVERVIEW OF THE RGP BUS CYCLES 


RGP Bus Operations are classified as either drawing or non- 
drawing and are based on the RGP Read, Write and Bus 
Status lines. Non-drawing bus cycles are composed of a 
minimum of three clock cycles. These consist of operand 
read and write operations which are wholly responsible for 
fetching instructions, maintaining stacks, queues, perform- 
ing screen refresh, plane control functions and loading BPU 
Control and Function Select registers prior to BITBLT. 


Drawing bus operations are composed of a minimum of two 
clock cycles which are allocated into BITBLT and Line draw- 
ing operations. Reserved for working in conjunction with the 
BPU(s) and the frame buffer array, the RGP serves as a bus 
status and address generator which is to source a variety of 


drawing bus cycles such as BITBLT Source read, BITBLT 
Destination write, BITBLT read-modify-write, and Line read- 
modify-write operations. 


Although the BSM is responsible for strobing the data and 
pixel latches, the RGP is responsible for directly strobing 
these particular BPU inputs: 


1. BO/LME—Pixel Address Zero/Left Mask Enable, 
2. B1/RME—Pixel Address One/Right Mask Enable, 
3. B2/FWR—Pixel Address Two/FIFO Write, — 

4, B3/FRD—Pixel Address Three/FIFO Read, 

5. L/B—Line/BITBLT mode, 

6. BSE—BITBLT. Source Enable. 


These lines will be activated either before or interactively 
during the BITBLT to ensure that the frame buffer data is 
properly manipulated. The L/B input is somewhat static in 


the sense that this input is changed a number of clock cy- 


cles prior to actually implementing the drawing function. 


NON-DRAWING BUS CYCLES 
Non-drawing operation are outlined in detail as follows: 
READ TRANSFER MECHANISM 


Read bus cycles transfer data from memory or peripheral 
devices to the RGP. A no-wait, or minimum Read bu cycle is 
composed of three unique “T” state clock cycles of which: 


1. state T1 produces ALE, address and Bus Status, 


2. state T2 and T2 wait (T2(w)) maintains: upper 8-bit ad- 
dress, bus status and samples WAIT, 


3. state T3 maintains: upper 8-bit address, Bus Status and 
samples data on AD15-0. 


During Read bus cycles, the RGP generates the following 
bus status: 


1. BS1 is a logic zero, 
2. BSO is a logic one, 
3. RD is a logic zero, 
4. WR is a logic one. 


lf necessary, Read Bus cycles can be extended on an inte- 
gral clock cycle basis by the assertion of WAIT (via the 
RGP’s WAIT input). Bus cycle extension is based on the 
premise that if WAIT is sampled low, (i.e., a logic zero at the 
PHI 2 falling edge sample window) during the T2 state, the 
RGP will produce a successive T2(w) state: an exact func- 
tional replica of the T2 state. Thus, T2(w) cycles will contin- 
uously be produced until WAIT is sampled high (i.e., a logic 
one). State T3 follows as a natural progression after T2 or 
T2(w). Valid data is sampled during the PHI 2 falling edge 
sample window of T3. 


Figure 27 illustrates a flowchart of the read transfer mecha- 


_ nism while Figure 22 illustrates functional bus information. 
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WRITE TRANSFER MECHANISM 


RGP Write bus cycles transfer data from RGP to memory or 
a peripheral device. A no-wait, or minimum Write operation 
is composed of three unique “T” state clock cycles of 
which: 

1. state T1 produces ALE, address and bus status, 


2. state T2 and T2 wait (T2(w)), which maintains upper 
8-bit address, bus status, asserts data and samples 
WAIT, 


3. state T3 which maintains the upper 8-bit address, bus 
status and drives data valid onto AD15—0. 


Architectural Description (Continued) 
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FIGURE 21. Read Transfer Fiow Chart 


Write bus cycles generate the following bus status: 
1. BS1 is a logic zero, 

2. BSO is a logic one, 

3. RD is a logic one, 

4. WR is a logic zero. 


If necessary, Write bus cycles can be extended on an inte- 
gral clock cycle basis by the assertion of WAIT. Bus cycle 
extension is based on the premise that if WAIT is sampled 
low, (i.e., a logic zero at the PHI 2 falling edge sample win- 
dow) during the T2 state, the RGP will produce a successive 
T2(w) state which is an exact functional replica of the T2 
state. T2(w) cycles will continuously be produced until WAIT 
is sampled high (i.e., a logic one). State T3 follows as a 
natural progression with Address, Bus Status and Data re- 
flecting the preceding T2, or T2(w) state. 


Figure 23 illustrates a flowchart of the Write transfer mecha- 
nism while Figure 24 illustrates functional bus information. 
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SCREEN REFRESH MECHANISM 


Video refresh operations transfer data from the frame buffer 
to the video screen refresh logic. A no-wait, or minimum 
Screen Refresh is composed of three unique ‘'T” state 
clock cycles of which: 


1. state T1 produces ALE, address and bus status, 


2. state T2 and T2 wait (T2(w)) maintains: all addresses, bus 
status and samples WAIT, 


3. state T3 maintains all addresses and bus status. 


Video Refresh bus cycles generate the following status in- 
formation: 


1. BS1 is a logic one, 
2. BSO is a logic one, 
3. RD is a logic zero, 
4. WR is a logic one. 


Video Refresh bus cycles can be extended on an integral 
clock cycle basis by the assertion of WAIT. Extension is 
based on the premise that if WAIT is sampled low, (i.e., a 
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logic zero at the PHI 2 falling edge sample window) during 
the T2 state, the RGP will produce a successive T2(w) 
state: an exact functional replica of the T2 state. Thus, 
T2(w) cycles will continuously be produced until WAIT is 
sampled high (i.e., a logic one). State T3 follows as a natural 
progression, with Address, Bus Status reflecting a logical 
equivalent of the preceding T2, or T2(w) state. 


PREDICTING DISPLAY REFRESH 


Prior to generating a Video Refresh cycle, the RGP will pro- 
duce an output called Display Refresh Request (DRREQ). 
This signal serves as a forerunner to the Video Refresh Cy- 
cle, which could be used as a high priority, or preemption 
mechanism at the time a HOST is granted the system bus. 
Because of the deterministic nature of DRREQ, this signal 
serves a vital role in execution of real-time screen refresh 
operations known as Mid-scan Line Load. Predicting the ac- 
tual start of the Display Refresh cycle is dependent on the 
activity of the system at the time of DRREQ. Figure 26 illus- 
trates a situation where a Display Refresh cycle is forced to 
wait until the completion of a Read bus cycle. In any given 
circumstance, the Read cycle could be any other type of 
RGP generated cycle. Often, a HOST could possibly be 
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Begin Next Cycle 


bus master at the time of DRREQ. There are two key points 
that one must keep in mind when the system requires Mid- 
Scan line load: 


1. The generation of DRREQ happens at constant intervals. 
2. The start of a Display Refresh is undetermined. 


3. The priority mechanism of the BSM is built such that a 
Display Refresh cycle will take place immediately after 
the current bus cycle, regardless of a pending memory 
refresh. 


4. Up to, and including the assertion and negation of the 
memory Data Transfer line, valid data is in the serial data 
pipe. 

Under most circumstances, serial data is transferred out the 

video shift registers by gating the RGP Blank output with the 

VRAM shift register clock (normaily the shift register clock is 

LCLK). Video Pipelining is then accomplished by taking the 

gated output and forcing this through a clocked shift register 

with the same clock that drives the VRAM serial shift regis- 
ters. 


Figure 25 illustrates a flowchart of the Screen Refresh 
transfer mechanism while Figure 26 illustrates functional 
bus information. 


Bus State Machine 
Present Status 


1. Arbitrate for Video Refresh 

2. Decode Address 

3. Assert System Data Transfer Strobe 

4. Assert System Address Strobes: RAS; CAS 


Mid=Scan Inner=Loop 


1. Loop until Video Serial Pipe is empty 


Complete Cycle 


1. Negate WAIT 
2. Negate System Data Transfer 
3. Negate System Address Strobes 


TL/F/9427-54 


FIGURE 25. Display Refresh Transfer Flow Chart 
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FIGURE 26. Functional Display Refresh Cycle Delayed by a Non-Drawing Read 
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Architectural Description (Continued) 


DRAWING BUS CYCLES 
Drawing operation is described in detail as follows: 
BITBLT SOURCE READ MECHANISM 


BITBLT Source Read operations transfer data from a draw- 
ing source address and place it in the FIFO of the BPU. A 
no-wait, or minimum BITBLT source operation is composed 
of two unique “T” state clock cycles of which: 


1. state T1 produces ALE, address and bus status, 


2. state T2 produces address, bus status, FIFO Write (FWR) 
and samples WAIT 


3. while T2 wait, (T2(w)), maintains all addresses, bus status 
and samples WAIT. 


During BITBLT-Source operations the RGP generates the 
following bus status: : 


1. BS1 is a logic one, 
2. BSO is a logic zero, 
3. L/B is a logic zero, 
4. RD is a logic zero, 
5. WR is a logic one. 


BITBLT-Source Bus cycles can be extended on an integral 
clock cycle basis by the assertion of WAIT. Extension is 
based on the premise that if WAIT is sampled low, (i.e., a 
logic zero at the PHI 2 falling edge sample window) during 
the T2 state, the RGP will produce a successive T2(w) 
state. Aside from the FWR generated during T2, T2(w) is an 
exact functional replica of the T2 state. Thus, T2(w) cycles 
will continuously be produced until WAIT is sampled high 
(i.e., a logic one). 

Upon sampling WAIT high, the RGP will generate a FIFO 
write (FWR) two clock cycles later. In most cases, drawing 
bus cycles will occur back to back, therefore, as a result, 
FWR will occur during T2 of the next BITBLT Source cycle. 


DP8500 


Address/Status Generation 


Generate: Bus Status BS1=0 = 10 
Set: RD/WR Status to Read 
Generate: 24= bit Word Address on A23=-0 
Generate BSE 

Generate Page Break 

Assert Address Strobe: ALE 

Set L/B to Blit 


oe Se 


Begin Next Cycle 


Complete Previous Bus Cycle 


1. Assert FIFO Write Strobe (FWR) 


In order for the BPU to maintain the proper internal data 
pipeline, a succession of events must occur of which: 


1. the RGP samples WAIT high during T2, 


2. on the next clock cycle, or as the RGP enters the next 
bus cycle T1 state, the BSM asserts DLE, 


3. on the next clock cycle, (during the next bus cycle T2) the 
RGP, completes the previous bus cycle by strobing FWR. 


BITBLT source read operations require that the valid data 
be maintained long enough on the system data bus to en- 
sure that data latched into the BPU is valid. Often, and for 
this reason, effective pipelining of the system results in a 
clean and efficient approach when dealing with BITBLT 
Source Read operations. : 


Figure 27 illustrates a flowchart of the BITBLT Source Read 
transfer mechanism while Figure 28 illustrates functional 
bus information. 


BITBLT DESTINATION WRITE MECHANISM 


BITBLT Destination write operations, reads data from the 
BPU FIFO and places the data into the specified destination 
address. From an RGP perspective, a no-wait, or minimum 
BITBLT Destination write operation is composed of two 
unique ‘‘T”’ state clock cycles, of which: 


1. state T1 produces ALE, Address Bus Status and FRD, 
2. state T2 produces Address, Bus Status, FRD and sam- 
ples WAIT, 


3. while T2 wait (T2(w)) maintains: Addresses, Bus Status 
and samples WAIT. 


Bus State Machine 


Present Status 


1. Arbitrate for Drawing Source Read 
2. Decode Address 

3. Assert System Address Strobes: RAS; CAS 
4, Assert System Read Strobe 

5. Negate WAIT 


Present BPU Data 


1. Assert Data Latch Enable (DLE) 


Complete Cycle 


1. Latch BPU Data 
2. if (Page Break = True) 
Negate System Address Strobes: RAS, CAS; 
else: 
Toggle CAS; 
3. Negate System Read 


TL/F/9427-56 


FIGURE 27. BLIT Source Read Transfer Flow Chart 
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FIGURE 28. Functional BLIT Source Read Cycle 


1-34 


Architectural Description (Continued) 


During BITBLT-Destinaton write operations the RGP gener- 
ates the following bus status: 


1. BS1 is a logic one, 
2. BSO is a logic zero, 
3. L/B is a logic zero, 
4. RD is a logic one, 

5. WR is a logic zero. 


BITBLT Destination Write Bus cycles can be extended. on 
an integral clock cycle basis by the assertion of WAIT. Ex- 
tension is based on the premise that if WAIT is sampled low, 
(i.e., a logic zero at the PHI 2 falling edge sample window) 
during the T2 state, the RGP will produce a successive 
T2(w) state. Aside from the FRD strobe, T2(w) is an exact 
functional replica of the T2 state. Thus, T2(w) cycles will 
continuously be produced until WAIT is sampled high (i.e., a 
logic one). 


DP8500 


Address/Status Generation 


1. Generate: Bus Status BS1-0 = 10 
. Set: RD/WR Status to Write 

. Generate: 24= bit Word Address on A23=0 
. Assert Address Strobe: ALE 

. Assert LME/RME masking 

. Generate Page Break 

. Assert FIFO READ (FRD) 

. Set L/B to Biit 


ono O1 & Ghd 


Complete Output Transfer 


Begin Next Cycle 


Figure 29 illustrates a flowchart of the BITBLT-Destination- 
Write transfer mechanism while Figure 30 illustrates func- 
tional bus information. 


BITBLT DESTINATION READ-MODIFY-WRITE 
MECHANISM 


At the system memory level, BITBLT Read-Modify-Write 
(RMW) operations are composed of a BITBLT-destination- 
read directly followed by a BITBLT-destination-write. A 
BITBLT RMW can best be described as a three stage pro- 
cess of which: 


1. the read portion of the bus cycle reads destination data 
from memory and latches it in the BPU. 


2. This data is logically combined in the BPU Logic Unit with 
the source contents read from the BPU’s FIFO (the modi- 
fy portion). 


Bus State Machine 


Present Status 


1. Arbitrate for Drawing Write 
2. Decode Address 
3. Assert System Address Strobes: RAS; CAS 
4, Assert System Write Strobe 

5. Assert Data Output Enable (DOE) 

6. Latch Data 


Complete Cycle 


1. Negate WAIT 
2. if (Page Break = True): 
Negate System Address Strobes: RAS, CAS; 
else: 
Toggle CAS; 
3. Negate System Write 


TL/F/9427-58 


FIGURE 29. BLIT Destination Write Transfer Flow Chart 


1-35 


00S8dd 


DP8500 


Architectural Description (Continueg) 


a 72 am 2 
PHI | 


--PH2 

AE mmx 
A23~16 meena 
Co 


CRE ECT ER Re SRE 


RD 
RGP GENERATED 

we OUTPUT 

WR 

aXe CK 

FRD 

ME XK seneva KX 

WAIT 


ADDRESS 

SWS NNN 
WOE BSM GENERATED 
WRITE OUTPUT 

BPU DATA — 

OUTPUT ENABLE \ f 

(DOE) . 
DATA 


| . . TL/F/9427-59 
FIGURE 30. Functional BLIT Destination Write Cycle 
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Architectural Description (Continued) 


3. A destination write cycle completes the bus operation by 
writing the resultant data back to the original destination 
memory location. 


From the RGP perspective, a no-wait, or minimum BITBLT 
RMW cycle is composed of two unique “T” state clock cy- 
cles, of which: 


1. state T1 produces ALE, address and bus status and FRD 
(see below), 


2. state T2 and T2 wait (T2(w)) maintains all addresses, bus 
status and samples WAIT. 


BITBLT RMW operations are composed of the following bus 
status: 


1. BS1 is a logic one, 
2. BSO is a logic zero, 
3. L/B is a logic zero, 


4. LME is automatically generated (a logic one) for left edge 
of BITBLT, | 


5. RME is automatically generated (a logic one) for right 
edge of BITBLT, 


6. RD is a logic zero, 
7. WR is a logic zero. 


A BITBLT RMW Bus cycle can be extended on an integral 
clock cycle basis by the assertion of WAIT. Extension is 
based on the premise that if WAIT is sampled low, {i.e., a 


pPasoo 


Address/Status Generation 


. Generate: Bus Status BS1=0 = 10 

. Set: RD/WR Status to Read modify Write 

. Generate: 24= bit Word Address on A23=0 
. Assert Address Strobe: ALE 


. Assert LME/RME masking 
. Generate Page Break 

. Assert FIFO READ (FRD) 

. Set L/B to Blit 


enous WD 


Complete Output Transfer 


Begin Next Cycle 


logic zero at the PHI 2 falling edge sample window) during 
the T2 state, the RGP will produce a successive T2(w) 
state. Aside from the FRD generated during T2, T2(w) is an 
exact functional replica of the T2 state. Thus, T2(w) cycles 
will continuously be produced until WAIT is sampled high 
(i.e., a logic one). 


_ Figure 37 illustrates a flowchart of the BITBLT RMW trans- 


fer mechanism while Figure 32 illustrates functional bus in- 
formation. 


LINE DRAWING 


Conceptually, line drawing operations are the same as 
BITBLT RMW operations with the exception that only one 
bit of the destination word is altered during each RMW cy- 
cle. This is realized by using the BPU’s Pixel Port contents 
as the “source” data rather than the data from the FIFO 
and selecting the bit position via the BPU’s pixel address 
inputs. Line drawing can best be described as a three stage 
process of which: 


1. destination data word is read from memory to the BPU, 

2. one of sixteen bits is selected (as determined by B3-0) 
which is mixed with the contents of the Pixel port (the 
modify portion). 

3. The write portion completes the bus operation by writing 
the accumulated results back to the destination memory 
location. 


Bus State Machine 


Present Status 


1. Arbitrate for Drawing Write 
2. Decode Address 

3. Assert System Address Strobes: RAS; CAS 
4, Assert System Read Strobe 


Present BPU Data 


1. Assert Data Latch Enable (DLE) 
2. Latch Data 


Complete Cycle 


. Negate WAIT 

. Assert Data Output Enable (DOE) 
. Assert System WriteStrobe 

. If (Page Break = True): 


Negate System Address Strobes: RAS, CAS; 
else: 
Toggle CAS; 

. Negate System Write 


TL/F/9427-60 


FIGURE 31. BLIT Read Modify Write Transfer Flow Chart 


1-37 


00S8dd 


DP8500 


Architectural Description (continued) 


1! 12. ns _ 12 
PHI 


PH? 


aL [—. 


A23-16 ——— eee baieoas eee 
4015-0 
ice 


RD 

WR | — 

a CT aD Cn 
FRD 


LME _— 
RME 
WAIT 
ADDRESS : | 
STROBES | a Siena tee Seiten anna GOES 
MEMORY : 
READ L/S 
De if °°» °°» » 
MEMORY 
WRITE \ / 
BPU DATA 
OUTPUT ENABLE | \ / " 
(DOE) : | | 


DATA 


BUS mnt DATA READ XK DATA WRITE 


FIGURE 32. Functional BLIT Read-Modify-Write Cycle 
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Architectural Description (Continued) 


From the RGP perspective, a no-wait, or minimum BITBLT 
Destination write operation is composed of two unique “T”’ 
state clock cycles, of which: 


1. state T1 produces ALE, address, bus status, 


2. state T2 and T2 wait (T2(w)) maintains all addresses, bus 
status, B3-0 and samples WAIT. 


The RGP’s Line operations generate the following bus 
status: 


1. BS1 is logic one, 
2. BSO is a logic zero, 
3. L/B is a logic one, 


4. B3-0 are generated to select one bit of the destination 
word, 


5. RD is a logic zero, 
6. WR is a logic zero. 


DP8500 


Address/Status Generation 


. Generate: Bus Status BS1-0=10 
. Set: RD/WR Status to Read modify Write 

. Generate: 24= bit Word Address on A23~-0 
Generate: 4—bit pixel Address on B3-0 
Set: L/B to Line mode 

. Assert Address Strobe: ALE 

. Generate Page Break 


Complete Output Transfer 


Begin Next Cycle 


Ft a 


Line drawing operations can be extended on an integral 
clock cycle basis by the assertion of WAIT. Extension is 
based on the premise that if WAIT is sampled low, (i.e., a 
logic zero at the PHI 2 falling edge sample window) during 
the T2 state, the RGP will produce a successive T2(w) 
state. T2(w) is an exact functional replica of the T2 state. 
Thus, T2(w) cycles will continuously be produced until WAIT 
is sampled high (i.e., a logic one). 

Figure 33 illustrates a flowchart of the Line drawing mecha- 
nism while Figure 34 illustrates functional bus information. 
Not included as part of the diagram is the generation L/B. 
The reason for this is because L/B is normally set to line 
mode a number of clock cycles prior to executing the actual 
drawing function. 


Bus State Machine 


Present Status 


1. Arbitrate for Drawing Write 
2. Decode Address 
3. Assert System Address Strobes: RAS; CAS 
4. Assert System Read Strobe 


Present BPU Data 


1. Assert Data Latch Enable (DLE) 
2. Latch Data 


Complete Cycle 


. Negate WAIT 
. Assert Data Output Enable (DOE) 
. Assert System WriteStrobe 
. If (Page Break = True): 
Negate System Address Strobes: RAS, CAS; 
else: 
Toggle CAS; 
. Negate System Write 
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FIGURE 33. Line Drawing Transfer Flow Chart 
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FIGURE 34. Functional Line Read-Modify-Write Cycle 
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Architectural Description (continued) 


SYSTEM PIPELINE CONSIDERATIONS 


In many high speed system applications, pipelining serves a 
crucial role in increasing the overall setup time required to 
perform a given system operation. Synchronous pipelining 
works on the principle of deliberately delaying various proc- 
essor outputs multiples of the system clock. In effect, the 
system views a delayed copy of the original Bus Master bus 
cycle. It is important to understand that a pipelined system 
doesn’t give something for nothing, and for this reason it is 
important to ensure that the amount of time the system 
processor spends (based on clock cycles) on the bus is 
equivalent to the time the system spends executing the bus 
cycle. Systems based on having an external BSM lends well 
to effective pipelining. 

When properly executed, pipelining gives the system extra 
time for decoding addresses, detecting possible changes in 
address, and allows much tighter control over critical output 
signals. 

Aside from providing a tighter control of time critical signals, 
RGP drawing bus cycles can benefit significantly in a pipe- 
lined system. In particular, the inherent nature of the pipe- 
lined BITBLT Source Read bus cycle can be optimized in a 
pipelined system such that with a one phase clock delay, 
the BSM assertion of DLE can easily be met during the next 
bus cycle T1. Please refer to Figure 28. 


The signals best suited for pipelining are: 

1. BSE, 

2. L/B, 

3. BO/LME, 

4.B1/RME, 

5. B2/FWR, 

6. B3/FRD. 

A one clock pipelined delay is sufficient for optimum opera- 
tion. 

INTERRUPT OPERATION 


The RGP provides to the user two hardware interrupt inputs: 
a Non-Maskable Interrupt (NMI) and an Interrupt (INT). NMI 
is sampled on a negative edge transition, while the INT input 
functions as a level detect. 


The nature of the INT input is such that it should be assert- 
ed as a system status, and maintained true until the RGP 
clears the Interrupt generating device. Because INT is sam- 
pled internally, proper setup and hold time relative to the 
falling edge of phase two guarantee that INT will be sam- 
pled in accordance with the timing specifications. 


Assertion of NMI and INT can happen asynchronously to — 


the RGP phase clock without causing any internal metasta- 
bility problems. However, in the event that INT violates set- 
up, recognition will not be guaranteed to happen until the 
next clock cycle. 
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Unlike the RGP HOLD and HOLDA mechanism where the 
acknowledgement latency is based on a ‘per-bus-cycle”’ ba- 
sis, acknowledgement of either NMI or INT takes place after 
the current instruction is complete. Consequently, Interrupt 
latency will exist prior to the RGP servicing the Interrupt 
Handling routine. 


HALT OPERATION 


Included as an RGP output is HALT. Depending on the sys- 
tem implementation, HALT can serve as a hardware or soft- 


~ ware handshaking mechanism. As an example HALT could 


be used for an ‘end of display-list processing” indicator. 
HALT is often used as a convenient mechanism for imple- 
menting double-buffering schemes. 


Whenever the HALT instruction is executed, the RGP will 
enter an internal HALT state, and the HALT output will be 
asserted. In this state, the RGP’s bus will remain driven, yet 
quiescent. However, some bus activity, in the form of Video 
refresh cycles will continue to be produced if that function 
has been enabled. During the HALT state, a HOST can as- 
sert HOLD, with the RGP immediately relinquishing the bus 
by providing the handshake response HOLDA (see BUS 
ARBITRATION). 


The RGP will continue to remain in the HALT state until 
either an NMI or INT is asserted. At that time the RGP will 
resume normal bus operation by branching to the appropri- 
ate interrupt service routine. 


RESET OPERATION 


Driven by a clean, external logical transition, the RGP Reset 
in (RSTI) forces the RGP to respond in the following man- 
ner: 


1. Forces the Program Counter to return to address zero. 
2. The Video Attribute Register is cleared. 


3. RSTO is asserted, and will continue asserted as long as 
RSTI is zero. 


4. ALE goes to logic zero, 
5. RD and WR goes to logic one, 
6. and BS1-0 = 00. 


When RSTI is asserted, it will not affect the contents of the 
Data, Address Registers or Program Flags, nor will it cause 
any type of internal metastability problems should it be as- 
serted asynchronously. However, in asynchronous applica- 
tions, it will require an additional internal clocked synchroni- 
zation cycle in order to function in accordance with the tim- 
ing specification. 

During reset, it is recommended the BSM be forced to a 
known state. Eight clock cycles after the release of Reset, 
the first bus cycle will appear on the bus. Figure 36 illus- 
trates the sequence. 
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FIGURE 35. External Reset Operation Timing 


BUS ARBITRATION MECHANISM 


Bus arbitration allows a HOST to arbitrate for the asia 
bus. The bus arbitration protocol takes on the following 
form: 


1. An external device asserts a HOLD request to the RGP, 


2. the RGP will assert HOLD acknowledge (HOLDA) after 
the current bus cycle, if no internal Video Refresh cycle is 
pending, 


3. during HOLDA, the RGP will float the Address lines, but 
will continue to drive the remaining outputs in a quiescent 
state. The outputs will reflect the last T-state of the previ- 
ous bus cycle. 


Unlike NMI and INT where acknowledgement is processed 
at the completion of the instruction, the HOLD and HOLDA 
arbitration mechanism is processed on a “per-bus-cycle” 
basis. The HOLD input is sampled on every falling edge of 
phase two. In asynchronous applications, assertion of 
HOLD will cause no internal metastability problems and is 
sampled true whenever the setup time is met according to 
the timing specifications. 


BUS ARBITRATION PRIORITY SCHEME 


Figure 2 illustrates the internal architecture of the RGP. The 
internal bus arbitration scheme of the RGP is based on the 
following: 

1. Video Refresh, 


2. HOLD, HOLDA mechanism 


status Is 
undetermined 
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RGP enters the bus 
‘ {dle state BS1-0 =00 
RD and WR=1 


Start of First 
Bus Cycle 


TL/F/9427-64 


3. Non-Maskable Interrupt (NMI), 
4. Interrupt (INT), : 
5. Non-Drawing and Drawing Bus Cycles. 


Generation of Video refresh must be the highest priority to 
ensure real-time operation. In situations where the RGP has 
relinquished the bus to the HOST, DRREQ can possibly 
serve as a Preempting, or high priority system interrupt in 
order to indicate an impending Video refresh. Because the 
internal RGP bus arbitration mechanism allows Video Re- 
fresh cycles to have highest priority, it is important that a 
similar priority structure is reflected in the structure of the 
BSM. 


RELATIONSHIP OF RGP PHASE AND LOAD CLOCKS 


The RGP is designed to clock under direction of an external 
Two-Phase, and Load Clock (LCK) sources. Both non-over- 
lapping Phase clocks are used to drive the main internal 
control logic block, while the LCLK input synchronizes the 
RGP to the external video data shift rate. Generation of 
DRREQ, Horizontal, Vertical and Plann sync signals are 
based on the LCLK source. 


In order to maintain consistency in the assertion of DRREQ, 
the RGP samples LCLK internally with the Phase input 
clocks. As a result, LCLK must always function at a sub-mul- 
tiple of the phase clock frequency (see AC timing specifica- 
tions), otherwise, generation of DRREQ, and internal me- 
tastability could result from such practice. 
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TL/F/9427-65 
Optimum DP8500 Buffer Bypassing 
Note 1: Capacitor type: Ceramic Note 4: For optimum Buffer performance, the diagram illustrates a recom- 
Note 2: Keep all capacitor lead length as short as possible; use leadiess if mended layout for capacitor and capacitor values. Depending on the system 
possible. parasitic capacitance, actual bypass capacitor values may vary. 


Note 3: Keep logic supply separate from buffer supply. 


0.1 uF DP8S00 0.1 uF 
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Optimum DP8500 Logic Bypassing 
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Absolute Maximum Ratings* 


If Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales 
Office/Distributors for availability and specifications. 


Storage Temperature Range —65°C to + 150°C 


Supply Voltage (Voc) —0.5V to 7V 
Voltage at Any Pin 
with Respect to GND —-0.5V to Voc + 0.5V 


Package Power Dissipation @ 20 MHz 2.5W @ 25°C 


DC Electrical Characteristics 


Commercial: Ta = 0°C to + 70°C, Voc = 5V +10%, GND = OV 
Military: Ta = —55°C to + 125°C; Voo = 5V +10%, GND = OV 


Thermal Characteristics Pca Package 


Note: Absolute maximum ratings indicate limits beyond 
which permanent damage may occur. Continuous operation 
at these limits is not intended; operation should be limited to 


_those conditions specified under DC Electrical Characteris- 
. tics. 


*Note: These are preliminary specifications. 


Military? 

Symbol Characteristics Conditions Unit 
ee ee a 
Mi P| i 
UN a ea nee ae ner ea OY a OY 
Vor PHI,PH2PinsOnly,MOS | Voc-05| | Voc-05| | V 
Vo.__|MOSCiockLow CL PHY PH2PinsOnly,mos| | 03 | | 03 |v 
Vort__|MOSClockRinging = PHIPH2PinsOny,Mos| -05 | 05 | -05 | 05 | V 
Vor | st = 3m |e || 
Vo | tn = ma | tos | os |v 
in| InputLeakage Curent | Vi=Vinorn =| sf eto] | 10| pa 
loz | TRISTATE LeakageforAt6-A23_ | Vo=VocorGND_ |_| #60] | #75] pA 

| TRLSTATE Leakage for All Other Outputs |Vo=Vocor@nD_— | | #10] | #10 A 
loc1__| Quiescentcurent | PHY,PH2atzomHz = || 80 | | 80 | ma 
lcce__| Supply Curent CL PHY PHZattoowrz =| | 10 | | 16 | ma 
lcca__| SupplyCurrent | PHY PHZat2oMHz =| | 70 | | 70 | ma 
Cn | nputCepacitance | tinatt MHz | S| to | st | 


Symbol Characteristics | Max Units 
05a Thermal Resistance—Ceramic g 
C/W 
Junction to Ambient 
Note: *Estimated . 
+ Preliminary 
5.0V GND 
5.0V 
R2 R3 
TEST 
TEST 
POINT POINT 


TL/F/9427-46 
RGP Output Test Load Circuitry 
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TL/F/9427-47 
RGP Output TRI-STATE Test Load Circuitry 


Note 1:C1 = 50 pF 
R1 = 6k 
R2 = 1.3kn 
R3 = 1.8k0 
Note 2: Connect SW to + 5V for tp,_z and tpz, measurements. 


Note 3: Connect SW to GND for tpy4z and tpz}; measurements. 


DP8500 AC Timing Characteristics 


Voc = 5.0 10%; GND = OV; Commercial Ta = 0°C to 70°C; Military Ta = —55°C to + 125°C; Loading = 50 pF 


Symbol Description Reference | Military | __ Commercial 
} Min | Max | Min | Max 


tCp 
tCh 
tnOVL 


tLCp 


tL.Ch 
tLCl 
tLCKs 
tLCKh 
tDis 
tDih 
tDv 
tDiv 
tADf 
tALv 
tALf 
tALav 
tALiv 
tALaiv 
tAHv 
tAHiv 
tAHf 
tAHav 
tLMEv 
tRMEv 
tFWRv 
tFRDv 
tLMEiv 
tRMEiv 
tFWRiv 
tFRDiv 
tLBv 
tBSEv 
tLBiv 
tBSEiv 
tHALTv 


| 23 | PH1 or PH2 Clock Period RE to Next RE 24s 
| 23 | PH1 or PH2 High Time RE50%toNextFE50% | 20 | | 


WB iain e E 
23 Next PH2 (PH1) RE 50% 

a eee | eT 

Mode 0&3 Mode 1&2 2"tCp 
LCK | LCKHighTime sd Time | LOCK RE 50% toFE50% | RE 50% to FE 50% 

ee eee ae 
| 29 | LCKHighSetupTime _—|_ BeforePHIRESO% | 5 | | 5 
| 29 | LCKHighHoldTime | AfterPHIRESO%® =| 5 | | 5 | 
| 24 | DatainSetupTime | BeforePH2FESO% = || 6 | = | 5 _ | 
| 24 | DatainHoldTime | After PH2FESO% | 10 | =| 10 
| 26 | DataValidTime S| AfterPH2RESO% =| | 43 | 
[26 | Datalwaidtime | AfterPHERESo%® | 6 | | 5 
| 20 | AD15-OBusFloating | AfterPH2RESO% |_| 42 | 
[20 | Address 15-0Vaid | AfterPH2RESo%™ | | «a | 
| 24 | Address 15-OFloat S| AfterPH2RESO% |_| 40 | 
| 24 | Address 15-ovalid | Before ALEFESO% | 2 | = | 8 | 
| 26 | Address 15-Olnvalid | AfterPH2RES0% =| 6 | =| 5 
| 27 | Address15-Olnvalid | After ALEFESO% =| 16 | | 18 
[20 | Address 29-16Vaid | AfterPH2RESO% | | 99 | 
| 24 | Address23-16 invalid | AfterPH2RESO% || 5 | | 
| 20 | Address23-16Floating | AfterPH2RESO% |_| 40 | 
| 24 | Address 16-23Setup | Before ALEFESO% | 2 | | 5 _| 
| 26 | LMEvaid =| After PHIRESO% =| | 43 | 
[28 | RMEVaic ———=«di;CAterP ESO | «| a | 
| 27 | FWRVaid | AfterPHIRESO% | | 48 | 
| 26 | FRDVaid | AfterPHIRESO% =| | 48 | 
[26 | iMetwaid _——+(|-AterPHIRESO” | @ | |e 
[26 | RMEivaic +i _AfterPHTRESOm™ | a | |e 
| 27 | FWRinvaid | After PHIRESO% || 8 | | 8 
| 26 | FRDInvaid | AfterPHYRESO% =| 8 | | 8 
[i valid ——SSS~*dtC er PRES || we | 
| 27 | BSEVaid | fterPHIRESO% =| | | 
[| tfBinvaid ———=S~=*dtCer PH RESOm™ |e | |e _| 
[27_| ascinvaid ———~«dtAerPHIRESOm | 8 | | 8 __ 
| 34 | ~Haltvaid | AterPHIRESO% =| | 42 | 
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DP8500 AC Timing Characteristics 


Voc = 5.0 £10%; GND = OV; Commercial Ta = 0°C to 70°C; Military Tp = —55°C to + 125°C; Loading = 50 pF (Continued) 
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DP8500 AC Timing Characteristics 
Voc = 5.0 +10%; GND = OV; Commercial Ta, = 0°C to 70°C; Military Ta = —55°C to + 125°C; Loading = 50 pF (Continued) 


| Military | Commercial 


Notes: Military Specifications are preliminary. Please contact your National Semiconductor Sales Office or Distributor for availability and specifications. 
The timing specifications listed above are based on a 50% threshold input voltage, with an output threshold of 1.5V. 

Note 1: The column titled Symbol holds the symbolic name of the parameter, as it appears on the timing diagrams. 

Note 2: The column titled Figure holds the figure number(s) of timing diagrams in which the parameter appears. 


Note 3: All parameters (except tLCp and tPWR) are times in nanoseconds. tLCp and tPWR are based on tCP clocks. Usually a number appears either in the 
column titled Max or in the column titled Min, since in most cases, only one of these is of significance. In cases where both are important, both appear. 


Note 4: The column titled Reference gives the name of a reference signal from which the given parameter is measured. The designation RE or FE following a 
signal name indicates the rising edge or falling edge of that signal. A percentage following RE or FE indicates the percent of the total rise or fall of the signal at the 
point from which the parameter being described is measured. 


Note 5: Minimum tRSTw is one tCp. 
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aPICaLr Performance Characteristics 
DP8500 Output Capacitance Derating Curve 


— 
a” 
eC 

~~ 

> 
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data sheet reference point 


loading (pf) 


0 10 20 30 40 50 60 70 80 


Note 1: All derated output values are based on the 50 pF data sheet reference point. 
Note 2: The graph may not be linear outside range shown. 

Note 3: Derated values apply to both the rising and falling edge. 

Note 4: The curve reflects worst case @ 70°C. 

Note 5: This curve applies ONLY to output drives. 
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FIGURE 36. HOLD Timing (Bus Initially idle) 
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FIGURE 37. HOLD Timing (Bus Busy with Drawing Cycle) 
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00S8dqd 


DP8500 


Timing Waveforms (continued) 
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Timing Waveforms (continued) 
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FIGURE 41. Operand Read Cycle Timing 
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Timing Waveforms (Continued) 
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FIGURE 43. BITBLT Source Read Cycle Timing 
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FIGURE 44. BITBLT Destination Write Cycle Timing 
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Timing Waveforms (Continued) 
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FIGURE 46. Line Drawing Read-Modify-Write Cycle 
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FIGURE 47. Mode 3 Video Refresh Cycle Timing (Bus Idle) 
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FIGURE 48. Mode 1 Display Refresh 
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Timing Waveforms (continued) 
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FIGURE 52. Non-Maskable Input Timing 
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FIGURE 53. RESET input Timing 
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Timing Waveforms (continued) 
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FIGURE 54. RESET Output Timing 
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FIGURE 56. BLANK Timing 
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Timing Waveforms (continued) 
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NS32CG16-10/NS32CG16-15 


National © 
Semiconductor 


PRELIMINARY 


NS32CG16-10/NS32CG16-15 | 
High-Performance Printer/Display Processor 


General Description | 


The NS32CG16 is a 32-bit microprocessor in the Series 
32000® family that provides special features for graphics 
applications. It is specifically designed to support page ori- 
ented printing technologies such as Laser, LCS, LED, lon- 
Deposition and InkJet. 

The NS32CG16 provides a 16 Mbyte linear address space 
and a 16-bit external data bus. It also has a 32-bit ALU, an 
eight-byte prefetch queue, and a slave processor interface. 


Block Diagram 


ADD/DATA CONTROLS & STATUS 
DMA PIPELINED | 
CONTROL BUS INTERFACE CONTROL 
PROGRAMMABLE “ 
BUS TIMING 
| CLOCK 
© | GENERATOR | 


DATA 


MICROCODE ROM 


- AND 
CONTROL LOGIC 


The capabilities of the NS32CG16 can be expanded by us- 


ing an external floating point unit which interfaces to the 


NS32CG16 as a slave processor. This combination pro- 
vides optimal support for outline character fonts. 


The NS32CG16’s highly efficient architecture, in addition to 
the built-in capabilities for supporting BITBLT (BIT-aligned 
BLock Transfer) operations and other special graphics func- 
tions, make the device the ideal choice to handle a variety 
of page description languages such as Postscript™’ and 
PCL™, 


REGISTER SET 


OTD GRAPHICS 
CFG LOGIC 
REGISTER ne 


| WORKING 
REGISTERS 


16 PIPELINED 
INSTRUCTION 
DECODER 


PIPELINED 
DISPLACEMENT AND 
IMMEDIATE EXTRACTOR 


32 BIT INTERNAL BUS 


Features 
m Software compatible with the Series 32000 family 
@ 32-bit architecture and implementation 
@ 16 Mbyte linear address space 
@ Special support for imaging applications such as print- 
ers, faxes and scanners 
— 18 graphics instructions 
— Binary compression/expansion capability for font 
storage using RLL encoding 
— Pattern magnification for Epson and HP LaserJet™ 
emulations 
— 6 BITBLT instructions on chip 
— Interface to an external BITBLT processing unit for 
very fast BITBLT operations (optional) 


TL/EE/9424-1 


m Floating point support via the NS32081 or the NS32381 
for outline fonts, scaling and rotation 

m On-chip clock generator 

m Optimal interface to large memory arrays via the 
DP84xx family of DRAM controllers 

m@ Power save mode 

@ High-speed CMOS technology 

@ 68-pin plastic PCC package 
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National | 
Semiconductor 


DP8&510 BITBLT Processing Unit 


General Description Features 

The DP8510 BITBLT Processing Unit (BPU) is a high-per- | ™ Supports all 16 classical BITBLT functions 

formance microCMOS device designed for use in raster m Pipelined data input for high system throughput 
graphics applications. It implements, in high-speed pipelined Flexible architecture allows BPU to be used with a 


01S8dqd 


logic, the data operations which are fundamental to BITBLT state machine or processor 
(BIT boundary Block Transfer) graphics: shifting, masking ™ Multiple BPUs can be used for multiple bitplane/color 
and bitwise logic operations. Under control of external hard- applications 


ware such as a state machine or a general-purpose micro- 
processor, it provides all necessary data path operations, 
easing the implementation of a wide variety of BITBLT sys- 
tems. A number of input pins control the proper data flow in 
the BPU. A simple handshake scheme is used to interface 
the CPU, the BPU and the memory system. 


@ Line drawing support 

= Compatible with static or dynamic RAMs, including 
Video DRAMs 

= Compatible with page mode, nibble mode and static 
column RAMs 


The BPU has two modes, BITBLT and line drawing. The ™ 22-bit to 16-bit barrel shifter 
mode is set by the B/L pin. The line-drawing mode canbe ™ 16-bit data port 

treated as a special case BITBLT with height and width @ 16-word FIFO 

equal to one. | @ 16-bit logic operations 

In order to perform a BITBLT operation, the BPU’s control ™ 20 MHz operation 

register must first be loaded with four parameters: the shift ™ Single +5 volt supply 


number, left and right masks and the function select code,a _™@ All inputs and outputs TTL-compatible 

total of 16 bits. BITBLT can then proceed, as directed by an. m Packaged in a 44-pin PCC (commercial) or 44-pin PGA 
external processor or state machine. It is the responsibility (MIL) 

of the controller to generate appropriate addresses for the =m Single-bit pixel I/O port 

BITBLT, to interface with the frame buffer’s memory control 4 member of National’s Advanced Graphics Chip Set 


circuitry, and to control the BPU itself. ™ microCMOS technology 


Block Diagram 
P 
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PA3_FRD 


ee) eT 
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DP8510 


Connection Diagrams 


44-Pin Plastic Chip Carrier (PCC) Package | 


S2eegeeFrEe Ys 
AIRING 
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N.C. = No Connection 
Order Number DP8510V 


See NS Package Number V44A 


44-Pin Grid Array (PGA) Package 


Description ae Description © ees 


PA2/FWR 
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DQ10 : Dat 
LVCC DQ2. 
DQ11 BGND1 
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DQ13 LGND 
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DQ14 DQ5 
DQ15 DQ6 
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Bottom View 


TL/F/8672-25 


Order Number DP8510U 
See NS Package Number U44A 
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Pin Definitions 
DQ0-DQ15: Data I/O Port, 16 bits wide. This is the main 


PDQn: 


PAO/LME: 


PA1/RME: 


PA2/FWR: 


PA3/FRD: 


data port which is connected to the frame 
buffer. It serves as the 16-bit input to the 
Data Input Latch for both source and destina- 
tion data. When DOE is active (low), this port 
serves as data output. 


Data Output Buffer Enable. A low signal on 
this pin enables the data output buffers of 
DQ0-DQ15. 


Pixel Data I/O Port, 1 bit wide. This |/O port 
is used in the line drawing mode only. It 
serves as the single bit input to the Pixel In- 
put Latch to provide a source bit for line 
drawing. When POE is active (low), this port 
serves as an output for the pixel selected by 
PAO-PAS. 


Pixel Output Buffer Enable. A low signal on 
this pin enables the pixel output buffer. This 
allows the CPU to read back the pixel data/ 
value from the frame buffer in a multiple bit- 
plane system. 


BITBLT or Line Drawing. A low on this pin 
enables the BPU for BITBLT operation. A 
high on this pin sets the BPU to the line draw- 
ing mode. 


Pixel Address 0 or Left Mask Enable. When 
the BPU is in the line drawing mode, this pin 
inputs the least significant pixel address. 
When the BPU is in the BITBLT mode, this 
pin receives the Left Mask Enable, an active 
high signal. This input must be synchronized 


' with respect to the falling edge of PH2. 


Pixel Address 1 or Right Mask Enable. When 
the BPU is in the line drawing mode, this pin 
inputs the second least significant pixel ad- 
dress. When the BPU is in the BITBLT mode, 
this pin receives the Right Mask Enable, an 
active high signal. This input must be syn- 
chronized with respect to the falling edge of 
PH2. 


Pixel Address 2 or FIFO Write control. When 
the BPU is in the line drawing mode, this pin 
inputs the third least significant pixel address. 
When the BPU is in the BITBLT mode, this 
pin is the active high FIFO write control input. 
This input must be synchronized with respect 
to the falling edge of PH2. 


Pixel Address 3 or FIFO Read control. When 
the BPU is in the line drawing mode, this pin 
inputs the most significant pixel address. 
When the BPU is in the BITBLT mode, this 
pin is the active high FIFO read control input. 
This input must be synchronized with respect 
to the falling edge of PH2. 
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BIS: 


TCS: 


PH1: 


PH2: 


CRE: 


DLE: 


Barrel Input Select. This signal controls the 
multiplexer prior to the BPU’s barrel shifter. If 
this signal is high, a wordwise swap is per- 
formed between the two 16-bit inputs to the 
barrel shifter. If this signal is low, no swap is 
performed. Therefore, if this signal is low, the 
BIL register serves as the most-significant 
word to the barrel shifter, with the DiL-Source 
register serving as the least significant word. 
Conversely, when this signal is high, DIL 
Source serves as the most significant input 
word to the barrel shifter, with BIL being the 
least significant word. 


BITBLT Source Enable, enables the BITBLT 
source input data path and controls the latch- 
ing function of the BITBLT source pipeline 
register. BSE should be held low (disabled) 
during the BITBLT destination data read/ 
write cycles. This input must be synchronized 
with respect to the falling edge of PH2. 


TTL Clock Select. This pin should tie to either 
Voc or Ground. A high level on this pin se- 
lects the TTL level clock input. The use of a 
conventional TTL clock, permitted at clock 
frequencies up to 10 MHz, simplifies system 
design. When using TTL clock all references 
to the falling edge of PH2 must be changed 
to the rising edge of PH1. A low level selects 
PH1 and PH2 (MOS level clocks) as the BPU 
clock inputs. 


Phase 1 clock input or the TTL clock input. 
When TCS is set low, this is the PH1 clock 
input, MOS level, maximum clock rate 20 
MHz. When the TCS pin is set high, this is the 
TTL clock input with a 10 MHz maximum rate. 


Phase 2 clock input. MOS level, maximum 
clock rate 20 MHz. When using PH1 as TTL 
clock input (TCS high), PH2 must be tied to 
ground. 


BPU Control Register Enable. A high signal 
on this pin enables the BPU’s Control Regis- 
ter. The data on DQ0-—DQ15 is latched into 
the BPU Control Register on the falling edge 
of the next PH2 clock. CRE must be synchro- 
nized with respect to the valid data and must 
be removed before the rising wis of the 
subsequent PH2 clock. 


Data Latch Enable. A high signal on this pin 
enables the BPU’s data input latch. The data 
on DQ0-DQ'15 is latched into the BPU data 
input latch on the falling edge of the next PH2 
clock. DLE must be synchronized with re- 
spect to the valid data and must be removed 
before the rising edge of the subsequent PH2 
clock. 
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Pin Definitions (Continued) 


PDLE: Pixel port Data Latch Enable. A high signal 
- . - an this pin enables the BPU’s pixel port data 
' input latch. The data bit on PDQn pin is 
latched into the one-bit pixel input data latch 
on the falling edge of the next PH2 clock. 
PDLE must be synchronized with respect to 
the valid pixel data and must be removed be- 
fore the rising ocue of the subsequent PH2 
clock. 


DOS: Data Output Select. DOS selects the data 
: output from either the FIFO (DOS = 1) or the 
BITBLT logic block (DOS = 0). 


RESET: FIFO control Reset. A low signal on this pin 

| resets the BPU’s FIFO read/write control cir- 

cuitry. Data previously stored in the FIFO or 

on-chip latches are unchanged. This pin is 

controlled by the RGP’s RESET line, an 

open-drain |!/O pin. This input must be syn- 

chronized with respect to the falling edge of 

PH2. PA2/FWR and PA3/FRD inputs must 

be low 1 clock cycle prior to and 2 clock cy- 

cles after asserting RESET low in order to 
correctly reset the FIFO counters. 


LVCC: Positive supply for on-chip logic circuits. 5V 
+10% 
LGND: Ground for on-chip logic. 


BVCC0- | Positive supply for output buffers, two pins. 
BVCC1: 5V +10%., 


BGNDO- 
BGND3: Ground for output buffers, four pins. 
BITBLT Fundamentals 


BITBLT, BIT-aligned Block Transfer, is a general operator 
that provides a mechanism to move an arbitrary size rectan- 
gle of an image from one part of the frame buffer to another. 
During the data transfer process a bitwise logical operation 
can be performed between the source and the destination 
data. BITBLT is also called RasterOp: operations on rasters. 
It defines two rectangular areas, source and destination, 
and performs a logical operation (e.g., AND, OR XOR) be- 
tween these two areas and stores the result back to the 
destination. It can be expressed in simple notation as: 


Destination <— Source op Destination. 
Op: AND, OR, XOR, etc. 


FRAME BUFFER ARCHITECTURE 


Generally, there are two kinds of frame buffer architectures: 
PLANE-oriented or PIXEL-oriented. BITBLT takes advan- 
tage of the plane-oriented frame buffer architecture’s attri- 
bute of multiple, adjacent pixels-per-word, facilitating the 
movement of large blocks of data quickly in a frame buffer. 
However, the plane-oriented architecture has one inherent 
problem: the limit of resolution for memory addressing and 
access is the word, rather than the pixel. The BITBLT 
source starting address, the BITBLT destination starting ad- 
dress, the BITBLT width and the BITBLT height are all de- 
fined in pixels. The BITBLT source data block may start and 
end at any bit position of any word, and the destination data 
block also may start and end at any bit position of any word. 
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BIT ALIGNMENT 


Before a logical operation can be performed between the 
source and the destination data, the source data must first 
be bit aligned to the destination data. In Figure 1, the source 
data need to be shifted three bits to the right in order to 
align the first pixel (that is, the pixel at the top left corner) in 
the source data block to the first pixel in the destination data 
block. For maximum performance, this alignment funcHon 
must be implemented with a barrel shifter. 


WORD BOUNDARIES AND DESTINATION MASKS 


Each BITBLT destination scan line may start and end at any 
position in any data word. The neighboring bits (the bits 
sharing the same word address with any words in the desti- 
nation data block, but not a part of the actual BITBLT rec- 
tangle) of the BITBLT destination scan line must remain un- 
changed after the BITBLT. Due to the plane-oriented frame 
buffer architecture, all memory operations. must be word- 
aligned. In order to preserve the neighboring bits surround- 
ing the BITBLT destination block, a left mask is needed for 
all the leftmost data words of the destination block, and a 
right mask is needed for all the rightmost data words of the 
destination data block. Both the left mask and the right 
mask remain the same throughout a given BITBLT opera: 
tion. 


WORD BOUNDARIES 
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FIGURE 1. A 32 by 32 Frame Buffer 


BITBLT Fundamentals (Continued) 


The following example illustrates the bit alignment require- 
ment. In this example, the graphics controller has a 16-bit 
wide data bus. Figure 7 shows a 32 pixel by 32 scan line 
frame buffer which is organized as a long bit stream which 
wraps around every two words (32 bits). Further, the top left 
corner of the frame buffer starts from the lowest word in the 
memory, address O0Ohex. Each word in the memory con- 
tains 16 bits, DQO0-DQ15. The most significant bit of a 
memory word, DQ15, is defined as the first displayed pixel in 
a word. In other words, memory’s DQ15 to DQO correspond 
to pixels 0 to 15 respectively. In this example, BITBLT ad- 
dresses are expressed in terms of pixel number, starting 
(with 0) from the upper-leftmost pixel. The BITBLT source 
starting address is set to 021hex (the second pixel in the 
third word). The BITBLT destination starting address is set 
to 204hex (the fifth pixel in the 33rd word). The BITBLT 
width is set to 013hex (= 19 decimal, corresponding to a 
width of 20 pixels). The BITBLT height is set to OO5hex (=5 
decimal, corresponding to 6 scan lines). 


The left BITBLT mask for the above example is: 
0000,1111,1111,1111 

The right BITBLT mask for the above example is: 
1111,1111,0000,0000 


Note: Zeroes in either the left mask or the right mask indicate the destina- 
tion bits which will not be modified. 


BITBLT DIRECTIONS 


The BITBLT moves a rectangular block of data in a frame 
buffer. For a plane-oriented frame buffer, the BITBLT pro- 
cess can.be considered a subroutine which has two nested 
loops. The loops are preceeded by the BITBLT setup com- 
putations. The outer loop is the BITBLT source and destina- 
tion scan line pixel starting address calculation and line 
count test for completion. The innermost loop is the actual 
BITBLT data movement for a single BITBLT scan line and 
word count test for completion. The length of the innermost 
loop is the word count of the BITBLT width. The length of 
the second loop is equal to the BITBLT’s height (number of 
scan lines involved in a BITBLT): 


BITBLT: calculate BITBLT setup parameters ;once per BITBLT 
such as 
width, height 
bit misalignment (shift number) 
left, right masks 
horizontal, vertical directions 
etc 


OUTERLOOP: calculate source, dest addresses ;once per scanline 
INNERLOOP: move data and increment addresses ;once per word 


UNTIL done horizontally 
UNTIL done vertically 
RETURN (from BITBLT). 


Each loop can be executed in one of two directions: the 
inner loop from left to right or right to left, the outer loop 
from top to bottom (down) or bottom to top (up). 


The ability to move data starting from any corner of the 
BITBLT rectangle is necessary to avoid destroying the 
BITBLT source data as a result of destination writes when 
the source and destination are overlapped (i.e., when they 
share pixels). This situation is routinely encountered while 
panning or scrolling. 
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A determination of the correct execution directions of the 
BITBLT must be performed whenever the source and desti- 
nation rectangles overlap. Any overlap will result in the de- 
struction of source data (from a destination write) if the cor- 
rect vertical direction is not used. Horizontal BITBLT direc- 
tion is of concern only in certain cases of overlap, as will be 
explained below. 


Figure 2 (a) and (b) illustrate two cases of overlap. Here, the 
BITBLT rectangles are three pixels wide by five scan lines 
high; they overlap by a single pixel in (a) and a single col- 
umn of pixels in (6). For purposes of illustration, the BITBLT 
is assumed to be carried out pixel-by-pixel. This convention 
does not affect the conclusions. 


In Figure (a), if the BITBLT is performed in the UP direction 
(bottom-to-top) one of the transfers of the bottom scan line 
of the source will write to the circled pixel of the destination. 
Due to the overlap, this pixel is also part of the uppermost 
scan line of the source rectangle. Thus, data needed later is 
destroyed. Therefore, this BITBLT must be performed in the 
DOWN direction. Another example of this occurs any time 
the screen is moved in a purely vertical direction, as in 
scrolling text. It should be noted that, in both of these cases, 
the choice of horizontal BITBLT direction may be made arbi- 
trarily. 

Figure 2(b) demonstrates a case in which the horizontal 
BITBLT direction may not be chosen arbitrarily. This is an 
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FIGURE 2. Overlapping BITBLT Operations 
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BITBLT Fundamentals (continued) 

instance of purely horizontal movement of data (panning). 
Because the movement from source to destination involves 
data within the same scan line, the incorrect direction of 
movement will overwrite data which will be needed later. In 
this example, the correct direction j is } from right to left. 


BITBLT VARIATIONS 
Some implementations of BITBLT are defined in ene of 


three operands: source, destination and mask/texture. This 


third operand is commonly used in monochrome systems to 
incorporate a stipple pattern into an area. These stipple pat- 
terns provide the appearance of multiple shades of gray in 
single-bit-per-pixel systems, in a manner similar to the ‘half- 
tone’ process used in printing. 

Destination <— Texture op1 Source op2 Destination 


While the BPU is essentially a two-operand device, three- 
operand BITBLT can be implemented quite flexibly and effi- 
ciently by performing the two operations serially. The BPU 
permits the use of any of its sixteen operations for each of 
the two operators shown above as ‘op1’ and ‘op2’. Addition- 
ally, the on-chip FIFO can be used to store the intermediate 
result (the result of opi later used as an operand of op2) 
thus. minimizing the number of memory accesses required. 


ENHANCING BITBLT PERFORMANCE 


There are various ways to enhance BITBLT performance 
(speed). The simplest way is to try to get data in and out of 
the memory system quickly. Most of the bitmapped graphics 
systems utilize DRAMs for both cost and storage density 
reasons. Since the BITBLT data shows strong locality, the 
graphics system can take advantage of certain fast memory 
access modes available to the DRAMs, such as page mode 
access, static column access, etc. The BPU, by means of 
an internal FIFO, can pipe the BITBLT source data to re- 
duce the frequency of switching out of the current page ad- 
dress space, thus maximizing the ability of the system to 
capitalize on the data’s locality. This operation is described 
in the following section. 


PIPING THE BITBLT SOURCE DATA 


When the BITBLT width is more than a word, up to 16 
source data words can be piped into the BPU’s on-chip 
FIFO. At the end of each BITBLT scan line or at the end of 
16 source data words, the controller switches from the 
BITBLT source address space to the BITBLT destination 
address space. When the BITBLT destination data word is 
fetched, two possible memory control sequences can be 
used. One is the modify-write sequence: write the BITBLT 
result back to the destination memory immediately after the 
logical function is executed. The second sequence involves 
storing the BITBLT result back to the BPU’s on-chip FIFO, 
to a maximum of 16 words. Either at the end of each 
BITBLT destination scan line or at the 16th destination data 
word, the BITBLT resultant data is then read out from the 
FIFO and written to the BITBLT destination memory se- 
quentially. 


Summary of the BITBLT Memory 
Control Sequences 


BITBLT MEMORY SEQUENCE | 


0) Load the BPU Control Register with [FS, SN, LM, RM}, 
via the data bus. 
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1) Read in the BITBLT source data up to 16 words (17 
words in certain cases), barrel-shift, then write them into 
the on-chip FIFO. (Only 16 3 barrel-shifted data words can 
be stored.) 


2) Read in the BITBLT destination data while the barrel- 
shifted source data is read out from the on-chip FIFO and 
the selected logical operation is executed, then write 
back to destination. 


3) Go to step 1 until the end of the BITBLT scan line. 
4) Go to step 1 for the remaining BITBLT scan mee 


BITBLT MEMORY SEQUENCE I 


0) Load the BPU Control negates with ee SN, LM, RM} via 
the data bus. 


1) Read in the BITBLT source data up to 16 werda (17 
words in certain cases), barrel-shift, then write them into 
the on-chip FIFO. (Only 16 barrel- shifted data words can 
be stored.) 


2) Read in the BITBLT destination data in sequence, exe- 
cute the selected logical operation and then write the re- 
sult back to the on-chip FIFO, maximum 16 BITBLT data 
words. 


3) Read BITBLT result from the FIFO and write them back 
to the BITBLT destination memory. 


4) Go to. step 1 until the end of the BITBLT scan line. 
5) Go to step 1 for the remaining BITBLT scan lines. 


DP8510 BIT ORDER 


The DP8510’s internal word bit-order is defined from D15 on 
the left to DO on the right. The pixel address bits PAO-PA3 
map directly to this left-to-right bit order convention. For ex- 
ample when PA(3:0) = 0 the rightmost bit of a displayed 
word will be addressed. When PA(3:0) = Ofh, the leftmost 
bit of a displayed word will be addressed (see Figure 3b). 
It should be noted that when pixel data is viewed in the 
frame buffer the order of the bits will be consistent with 
most conventional microprocessor conventions that define 
DO in the LSB bit position or in the rightmost bit location 
(see Figure 3a). 


Block Diagram away 
GENERAL 


Figure 4 illustrates the block diagram of the BPU. It consists 
of several 16-bit latches and multiplexers connected via 16- 
bit data paths to the three major functional blocks: the Bar- 
rel Shifter, the FIFO and the BITBLT Logic Unit. Latches are 
provided for input data from the BITBLT source and destina- 
tion as weil as for control parameters. 


Control parameters consist of 16-bit data which are written 
to the Control Register. This data is used, in conjunction 
with the BPU’s external control pins, to route and modify the 
BITBLT data. 


The primary flow of data during a BITBLT is from the DQO- 
DQ15 pins, through the Master Data Input Latch (DIL-Mas- 
ter) to either the source path or destination path to the 
BITBLT Logic Unit. The output of the logic unit is then rout- 
ed through tri-state buffers back to the DQ0-DQ15 pins. 


Block Diagram Description (Continued) 


ae 16 Bits aie TREN, 
D15 Shift Order ——-———_————_ D0 


Peer TT | Word 0 (Address n) 
PET TT et be TL | Word 1 (Address net) 
e 


e@ 
Pt tT tT tT EE Et Be Bees TL] Word n (Address nn) 


TL/F/8672-~26 


a) Data Representation in Frame Buffer 
(as Seen by the CPU) 
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b) Data Representation in DP8510-Based Display 


FIGURE 3. Frame Buffer vs. Display 
Coordinate Bit-Order Representation 


The source path consists of the Source Data input Latch 
(DiL-Source), the Barrel Input Latch (BIL), the Barrel Shifter 
and the FIFO. The destination path consists of only the Des- 
tination Data Input Latch (DIL-Destination). The destination 
path terminates directly at one input port to the logic unit. 
The logic unit’s other input port receives either the output of 
the FIFO or the output of the Pixel port data Input Latch 
(PIL), as determined by the state of the B/L pin, for BITBLTs 
and line drawing, respectively. 
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The output of the logic unit is routed through multiplexers to 
the output drivers for both DQO0—DQ15 and PDQn. In the 
case of DQ0-—DQ15, the multiplexer permits the FIFO out- 
put, rather than the logic unit output, to be routed to the 
output drivers. The PDQn port, being a single bit port, is 
driven by one of the sixteen output bits of the logic unit, as 
selected by the multiplexer, according to the state of the 
pixel address lines, PAO—PA3. 
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Block Diagram Description (Continued) 
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FIGURE 4. DP8510 BPU Detailed Block Diagram 
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Functional Block Description 


BPU CONTROL REGISTER 


The BPU Control Register consists of 4 fields, each field 
being four bits wide. The BITBLT Function Select field, FS, 
selects one of the 16 BITBLT operations. The barrel Shift 
Number field, SN, controls the number of bit positions shift- 
ed by the barrel shifter. The Left Mask field, LM, sets the 
BITBLT left mask pattern. The Right Mask field, RM, sets 
the BITBLT right mask pattern. 

The SN will be explained in the barrel shifter section. The 
LM and the RM will be detailed in the BITBLT logic block 
section. 


D15 DO 


15 114113112) 111101918)7161514)312111 0 
FS SN LM RM 


FIGURE 5. Control Register Fields 


The content of this register must be loaded prior to the first 
BITBLT operation by asserting CRE (Control Register En- 
able). Only the Function Select, FS, is valid when the BPU is 
in the line-drawing mode. The SN, RM and LM have no 
effect in the line-drawing mode. 


FUNCTION SELECT 


The 16 classical BITBLT functions supported by the BPU 
are described in Table |. 


SHIFT NUMBER 


There are 16 different barrel shift positions controlled by the 
SN of the Control Register. 

Let A=a15...a0,B=b15...b0 andC=c15...c0,AandB 
are the input words, C is the output. An i-bit barrel-shift oper- 
ation on A and B denoted by C= S(i;A,B) is the operation of 
concatenating the most significant (i) bits of word B to the 
least significant (16-i) bits of word A. 
When A = B, the C=S(i;A, B) is equivalent to having i-bit 
circular left shift. The i-bit circular right shift is equivalent to 
the (16-i) bit circular left shift. 

The truth table of the barrel-shift operation S(i;A, B) is 
shown in Table Il. (i = shift number) 


LEFT AND RIGHT MASKS 


The Truth Table for the left and right mask is shown in Table 
ill. Zeroes in the mask patterns indicate bits in the destina- 
tion data which will be preserved during the BITBLT opera- 
tion. 


Both the LM and the RM can be invoked simultaneously. 
This may be necessary where the BITBLT width is less than 
16 pixels. 


TABLE |. BITBLT Function Definitions 


# 3 f2 f1 f0 
0 0 0 0 0 0 
1 0 0 0 1 -S AND -d 
2 0 0 1 0 -s ANDd 
3 0 0 1 1 -S 
4 0 1 0 0 s AND -d 
5 0 1 0 1 -d 
6 0 1 1 0 s XOR d 
7 0 1 1 1 -s OR -d 
8 1 0 0 0 s AND d 
9 1 0 0 1 s XNOR d 
10 1 0 1 0 d 
11 1 0 1 1 -sORd 
12 1 1 0 0 Ss 
13 1 1 0 1 s OR -d 
14 1 1 1 0 sORd 
15 1 1 1 1 1 


Note: d: destination 
s: source 


{3-0: function select code, which selects one of the 
16 BITBLT functions. 


TABLE II. Truth Table for Barrel-Shift Operation S(i;A, B) 
c8 


ci2 ci1  c10 c9 
0 aid ai4 a13 al2~= ati ai0 a9 
1 ai4 ai3 atl2. att ai0 a9 a8 
2 ai3 al2 aati atl0 aQ a8 a7 
3 ai2 ait  at10 aQ a8 a7 a6 
4 ai1 ai0 a9 a8 a7 a6 a5 
5 a10 aQ a8 a7 a6 a5 a4 
6 a9 a8 a7 a6 a5 a4 a3 
7 a8 a7 a6 a5 a4 a3 a2 
8 a7 a6 ad a4 — sso a3 a2 at 
9 a6 a5 a4 a3 a2 al ad 
10 a5 a4 a3 a2 at a0 b15 
11 a4 a3 a2 al a0 b15 b14 
12 a3 a2 al 10) b15 b14 b13 
13 a2 al a0 b15 b14 b1i3 b12 
14 al a0 b15 b14 b13 b1i2 ~~ bili 
15 a0 b15 bi4 b13 b12 b11~ = b10 


c7 c6 c5 c4 


a6 a5 a4 a3 a2 al ad bi5 
a5 a4 a3 a2 al a0 bi5 b14 
a4 a3 a2 ail ad b15 b14 b13 
a3 a2 al al b15 b14 b13 = b12 
a2 al ad b15 b14 %bi3 b12~ Obit 
al ad b15 b14 #b13 »b12 b11 = #42b10 
ad b15 b14 b13 =b12 b11 ~~ »b10 b9 
b15 bi4 bi3 b12 #42b11 ~~ = »b10 b9 b8 
bi4 bi3 bi2 b11~ b10 b9 b8 b7 
bi3 bi2 bii  6b10 b9 b8 b7 b6 
b12 bii  b10 b9 b8 b7 b6 b5 
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Functional Block Description (continued) 
TABLE Ill. Truth Table for Left and Right Mask 


7 | D15 | DO 
LM= 0, L—mask=1111 1111 1111 1111 
LM= 1, L—mask=0111 1111 1111 1111 
LM= 2, L—mask=0011 1111 1111 1111. 
LM= 3, L—mask=0001 141111 1111 1111 
LM= 4, L—mask=0000 1111 1111 1111 
LM= 5, L—mask=0000 0111 1111 1111 
LM= 6, L—mask=0000 0011 1111 1111 
LM= 7, L—mask=0000 0001 1111 1111 
LM= 8, L—mask=0000 0000 1111 1111 
LM= 9, L—mask=0000 0000 0111 1111 
LM=10, L—mask=0000 0000 0011 1111 
LM=11, L—mask=0000 0000 0001 1111 
LM=12, L—mask=0000 0000 0000 1111 
LM=13, L—mask=0000 0000 0000 0111 
LM = 14, L—mask=0000 0000 0000 0011 
LM=15, L—mask=0000 0000 0000 0001 
RM = 0, R—mask= 1000 0000 0000 0000 
RM= 1, R—mask=1100 0000 0000 0000 
RM= 2, R—mask=1110 0000 0000 0000 
RM= 3, R—mask=1111 0000 0000 0000 
RM= 4, R—mask=1111 1000 0000 0000 
RM= 5, R—mask=1111 1100 0000 0000 
RM= 6, R—mask=1111 1110 0000 0000 
RM= 7, R—mask=1111 1111 0000 0000 
RM= 8, R—mask=1111 1111 1000 0000 
RM= 9, R—mask=1111 1111 1100 0000 
RM = 10, R—mask = 1111 1111 1110 0000 
RM = 11, R—mask=1111 1111 1111 0000 
RM = 12, R—mask= 1111 1111 1111 1000 
RM = 18, R—mask= 1111 1111 1111 1100 
RM = 14, R—mask=1111 1111 1111 1110 
RM = 15, R—mask= 1111 1111 1111 1111 


= Enable Logic OP 
0 = Disable Logic OP 


DQ0=-DQ15, FROM DIL=MASTER 


DILS15 


ai5 


DQ15 


DIL=SOURCE 


DILSO 


BARREL INPUT LATCH 


32=TO 16= BIT BARREL SHIFTER 


BARREL-SHIFTER oe 


The function of the barrel shifter is to align the BITBLT 
source data to the destination data. Bit alignment may cross 
word boundaries. - 98 


The barrel shifter in the BPU is implemented as a 32- to 16- 
bit multiplexer. Depending upon the type of the BITBLT and 
the BITBLT length, the necessary source data word(s) are 
fetched into the BPU forming a 32-bit input to the barrel 
shifter. Barrel Input Latch (BIL) stores the first source data 
word fetched and the Source Data Input Latch (DIL-source) 
stores the subsequent source data word. 


A multiplexer precedes the barrel shifter to swap the input 
words if necessary. That is, Barrel Input Select (BIS) causes 
BIL (and DiL-Source) to be routed to the left or right (and 
right or left) half of the barre! shifter. Note that this is a word- 
level swap; it does not affect the ordering of the bits within 
the words. This swap mechanism facilitates the fetching of 
BITBLT data from left-to-right or right-to-left. 


When the BITBLT direction is set from the left to the right 
(the source data words are fetched starting from the left- 
hand side of the BITBLT rectangle), the BIS pin should be 
set low. If the BITBLT direction is set from the right to the 
left, BIS must be set high to exchange the source data se- 
quence. 


Figure 6 depicts the data path in the BPU’s barrel-shifter 
block. 


The Barrel Input Latch (BIL) is the BITBLT source data pipe- 
line register. It is loaded (from DiL-Source) one clock cycle 
after DiL-Source is loaded. Therefore, when the system 
fetches a BITBILT source word from memory (BSE assert- 
ed), this word will be first loaded into DiL-Source on the 
rising edge of a PH1; it will then be transferred to BIL on the 
rising edge of the following PH1. As a result, if the two 
words have been fetched in temporal sequence, the first 
word fetched will be in BIL;.the second word will be in DIL- 
Source. It should be noted that this condition (different 
words in BIL and DiL-Source) will be true for only one clock 
period, since the next rising edge of PH1 will again cause 
BIL to be loaded from DiL-Source. Therefore, the shifted 
result (the output of the barrel shifter) must be written to the 
FIFO on the rising edge of PH1 immediately following the 
PH1 used to load DiL-Source. 


DQ0 


BLO DILS15 


ad b15 bO 


C15=CO 


TO FIFO BLOCK TL/F/8672~7 


FIGURE 6. BPU Barrel-Shifter Block 
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Functional Block Description (Continue) 


Source Data Multiplexing, an Example: 


Figure 7 shows an example of the BPU Barrel Shifter opera- 
tion. Let Word A = aF..a0, Word B = bF..b0, SN = S3..SO 
and BSE = 1. In both cases below, Word B is fetched after 
Word A. 


(FWR) and the FIFO Read (FRD). The use of faster, local- 
ized memory access modes (e.g., page, static column) is 
supported by the BPU via the FIFO, since the FIFO provides 
storage for multiple, barrel-shifted source words. 


Figure 8 depicts the BPU FIFO structure. 


0lS8dd 


FIFO 


The FIFO, 16 bits by 16 words, constitutes the BPU’s on- 
chip storage and is implemented with dual-port RAM. The 
FIFO has separate READ/WRITE controls, the FIFO Write 


Case |. If BIS = 0, 


Word A B 
FEDCBAQOSB76543210FEDCBA9876543210 


mn 
@ 


aFaEaDaCaBaAada8a7a6aba4a3a2ala0 
aEaDaCaBadAaQa8a7abada4asazala0bF 
aDdaCaBadada8a7abada4a3a2ala0bFbE 
aCaBaAa9a8a7aGaSa4asazala0bFbEbD 
aBada9a8a7abada4asazala0bFbEbDbC 
eadada8a7abadia4asazala0bFbEbDbCbB 
aQa8a7aGada4asa2ala0bFbEbDbCbBbA 
a8a7a6ad5a4asazaladbFbEbDbCbhBbAbY 
ava6aSa4asazala0bFbEbDbCbBbAb9bS 
aGada4asaZala0bFbEbDbCbBbAb9b8b7 
a5a4a3a2ala0bFbEbDbCbBbAb9b8b7b6 
a4a3a2Zala0bFbEbDbCbBbAb9Ob8b7bEb5 
a3a2ala0bFbEbDbCbBbAbObSb7bEb5b4 
a2Zala0bFbEbDbCbBbAb9bBb7bEb5b4b3 
alaObFbEbDbCbBbAbObBb7bEb5b4b5b2 
aQ0bFbEbDbCbBbAb9ObSb7bEbSb4b3b2b1 


Hao OW PrP OMNIA PWN FO 


Case Il. If BIS = 1 (The BIS controls the multiplexing before the barrel shifter), 


Word B . A 
FEDCBAS987654353210FEDCBA987654353210 


O bFbEbDDCDBbADObSb7bEb5b4b3b2b1b0 

1 bEDDbCbBbADObDSb7bEbSb4b35b2b1b0aF 

2 bDbCbBbAbObBb7bEbSb4b3b2b1b0aFak 

3 bCbBbAb9b8b7bE6b5b4b3b2b1b0aFaEaD 

4 bBbAbODBb7bHEb5b4b5b2b1b0aFaEaDac 

5 bAb9bSb7b6b5b4b3b2b1b0aFaEaDaCaB 

6 b9b8b7bEb5b4b35b2b1b0aFaEaDaCaBaA 

7 b8b7b6b5b4b5b2b1b0aFaEaDaCaBaAad 

8 b7b6b5b4bS5b2b1Lb0aFaEaDaCaBaAagas 

9 b6b5b4b35b2b1b0aFaEaDaCaBaAa9a8a7 

A b5b4b35b2b1b0aFakaDaCaBaAa9a8a7a6 

B b4b5b2b1b0aFabaDaCaBaAa9a8a7a6a5 

C b3b2b1lb0aFaEaDaCaBaAa9a8a7aba5a4 

D b2blb0aFaEaDaCaBadAa9a8a7a6a5a4a3$ 
E blb0aFaEaDaCaBaAa9a8a7ab6aSa4asa2 
F 


bOaFaEaDaCaBadadSa8a7a6Gadata3sazal 
FIGURE 7 
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Functional Block Description (continue) 


FROM FROM 
‘BARREL SHIFTER BLOCK ‘BITBLT LOGIC UNIT 


CO BLIS 


FIFO 


16 BIT BY 16 WORDS RESET 


FROM BITBLT LOGIC UNIT 


TO DATA OUTPUT BUFFERS 


TL/F/8672-8 
FIGURE 8. BPU FIFO Block 


BITBLT LOGIC CONTROL UNIT 


This block performs the selected BITBLT logical operation 
and the BITBLT destination data masking. ; 


The 16 BITBLT functions can be expressed as: 


f3* ( s* a) + 
f2* ( s* -d ) + 
fl* (-s* dad)+ 


fo * ( -s * ~d ) 


where f0--f3 are the bits of the Function Select field, FS. 


LINE DRAWING MODE (B/L = 1) 
Line drawing mode differs from BITBLT mode in the follow- 
ing respects: 
1) the FIFO is not used in line drawing mode. 
2) the Shift Number parameter is ignored in the line draw- 
ing mode 
3) the Left Mask and Right Mask parameters are ignored 
in the line drawing mode. Instead, masks are generated 
in matched pairs, as needed, according to the current 
state of the Pixel Address lines PAO-PAS. 


For example, if the pixel address (the binary value on 
PAO-3) is 12 decimal (PA3 = PA2 = 1, PA1 = PAO = 
0), then the following masks will be generated: 
MS bit LS bit 
1 L 
LM = 0000 0000 0000 1111 
and RM = 111111111111 1000 
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Since these masks are used simultaneously when the 
destination word is generated (during line drawing 
mode), and since zeroes in the mask indicate destina- 
tion bits that are unaffected (are preserved) by the 
BITBLT operation, the AND of the two masks indicates 
the bit that will be affected by the BITBLT. In this exam- 
ple, bit 3 of the memory word is selected. | 


It is important to note this mapping convention of the 
BPU: the logical pixel (indicated by PAO-—3) associated 
with the current line drawing BITBLT operation corre- 
sponds to a physical bit (indicated by the ones-comple- 
ment of PAO-—3) in the current memory word. That is, 
logical pixel 0 corresponds to DQ15 in any word. 
Logical pixel 1 corresponds to DQ14 in any word, etc. 


4) Conceptually, during line drawing, there is no “source”’, 


in the conventional BITBLT sense of replicating an item 
from one set of storage locations to another set of stor- 
age locations. Rather, the line is “created” bit-by-bit by 
an address-generating algorithm. However, a mecha- 
nism is provided in the BPU to establish a single bit 
which can be logically combined (in the BITBLT Logic 
Unit) with each bit of the destination. This mechanism is 
embodied in the pixel data port and its associated input 
latch (PIL). 


5) the PIL bit is replicated into all 16 bits of the logic unit’s 


source input port in line drawing mode. 


6) a single bit of the logic unit’s output port is selected to 


drive the PDQn output buffer, according to PAO-—PA3, 
in line drawing mode. In BITBLT mode, this multiplexer 
is at TRI-STATE®. 


DESTINATION DATA BUS 16 


FIFO DATA OUTPUT 146 


FO1S FOO DDI5 ppo 
FS 
‘ BITBLT LOGIC 
LM 4 
LME 
Mg MASK CONTROL LOGIC 
RME 


BLIS-BLO 
16 


| TO DATA OUTPUT MUX 
oe. . TL/F/8672-9 
FIGURE 9. BPU Logical Control Unit 


Applications of the DP8510 BPU 


SYSTEM CONTROL SEQUENCE 


Figure 10 illustrates a typical control and memory access 
sequence that might be used to accomplish BITBLT with the 
BPU. In this example, memory access time is sufficiently 
fast to avoid wait states (the BPU is capable of accepting a 
new operand at each clock). All control strobes are as- 
sumed to be generated by the CPU/state machine which is 
serving as the BPU’s controller. 


In this example, a single operation is performed per clock 
cycle for the sake of clarity. Overlap of memory access with 
internal BPU operations would reduce the number of clock 
cycles. 
Clock Action 
1. address for source operand 1 is generated, first 
RAM access is started, data becomes valid at end 
of cycle 
2 source operand 1 is latched in data input latch, 
address for source operand 2 is generated, second 
RAM access is started, data becomes valid at end 
of cycle 
3 source operand 2 is latched in data input latch 
(forming the 32-bit input to the barrel shifter), data 
propagates through the barrel shift logic and 
becomes valid at the input to the FIFO 
4 the barrel-shifted source word is written to the FIFO, 
the BPU’s source path is disabled, the BPU’s 
destination path is enabled 
5 the address for the destination word is generated, 
the destination RAM access is started, data 
becomes valid at end of cycle 
6 destination data is latched into data input latch, 
source word is read from internal FIFO, BITBLT 
logic unit produces resultant word 
7 output buffers are enabled destination write to RAM 
occurs. 


FIFO STRUCTURE, OPERATION AND RESTRICTIONS 


The DP8510 BPU’s FIFO is implemented as a file of sixteen 
16-bit registers. The file has separate ports for read and 
write operations. Register selection is accomplished via a 
pair of shift-register ring counters. One of these is used dur- 
ing FIFO write cycles and is incremented by FIFO Write 
(FWR); the other is used during FIFO read cycles and is 
incremented by FIFO Read (FRD). At RESET, both ring 
counters are set to enable FIFO register 0. Both ring coun- 
ters will wrap around; that is, they will select the FIFO regis- 
ters in the sequence: 
0,1,2, ..., 13, 14, 15, 0, 1, 

During a FIFO write cycle, the FIFO register currently select- 
ed by the WRITE ring counter will be written with the appro- 
priate internally-sourced data (barrel shifter output or logic 
unit output, according to the state of BSE); the WRITE ring 
counter will then be incremented to select the next FIFO 
register. 


The FIFO read operation is implemented in a read-ahead 
manner as follows: the contents of the FIFO register select- 
ed by the FIFO READ ring counter is copied to a master 


latch (of a master-slave pair) during each PH2 regardless of 
the state of FRD; however, the contents of this latch are not 
passed on to the slave (output) latch until the next PH1 
when FRD is active. This FRD also causes the FIFO READ 
ring counter to be incremented to the next FIFO register, 
initiating the read-ahead of that register in anticipation of the 
next FIFO read cycle. 


In general, a given clock cycle can accomplish a simulta- 
neous FIFO read and FIFO write. However, two restrictions 
apply to FIFO operations. First, a simultaneous FIFO read 
and FIFO write should apply to o/fferent FIFO registers; that 
is; the SAME register should NOT be read and written in the 
same cycle. Failure to meet this requirement is not destruc- 
tive, but the result may be contrary to what the user in- 
dends: the output value will be the ‘‘old” contents of the 
currently-selected FIFO register, not the “new” value being 
written to that register. This behavior is a result of the read- 
ahead operation in conjunction with the two-phase master- 
slave output latch. 


The second restriction is a speed-dependent one; it does 
not apply at clock frequencies of 10 MHz and below. At 
higher clock speeds, a two clock-cycle minimum is imposed 
between the FIFO write to a given location and the corre- 
sponding FIFO read of that location. Failure to meet this 
restriction may result in incorrect data being read from the 
FIFO. Consider the case of the FIFO at RESET: if the first 
FIFO write is performed during clock cycle (m), the first 
FIFO read (which is the read of that location) may not take 
place until clock cycle (m+ 2). An equivalent ‘‘FIFO-initially- 
empty” case occurs whenever the (n)th FIFO location is 
written and the (n—1)th FIFO location is read during the 
same clock cycle. This restriction does not impair the ability 
of the BPU to perform consecutive FIFO reads and/or FIFO 


- writes at one-clock intervals. 


BPU USED WITH A CPU OR STATE MACHINE 


_ The BPU places no restrictions on the practical number of 
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planes in a system; rather, systems can be expanded from 
one to any number of planes without compromising BITBLT 
performance. Data movement within each plane is carried 
out in parallel by a dedicated BPU for each plane; as a 
result, a BITBLT can be performed for any number of planes 
in the same period of time as in a single-plane monochrome 
system. 


Systems currently implementing BITBLT completely in soft- 
ware can realize a major performance increase through the 
use of the BPU. In this case, the BPU can be thought of as a 
hardware accelerator for BITBLT. Two examples of the BPU 
in this application are shown in Figures 17 and 72. |In both 
examples, the CPU is responsible for BITBLT preprocess- 
ing, such as the determination of direction, initial BITBLT 
addresses, etc. These examples illustrate the variety of 
cost/complexity versus performance tradeoffs that may be 
made. While Figures 17 and 72 illustrate a single plane ap- 
plication, extension to multiple planes can be achieved by 
implementing a BPU on each bitplane which will result in 
virtually the same performance while delivering a greater 
number of shades or colors. 
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nppreatons of the DP8510 BPU (Continued) 


BITBLT BITBLT = 


SOURCE _. DESTINATION. 
— READ 1 READ 2. READ 1 roe MODIFY WRITE 1 
0 7 8 

20 MHz 

PHI 
ADDRESS 

VALID ; zi hcl a 
ADDRESS OXY st XXX 22) a XKXXXX Sean XX — ——— x | 


ee ae NE 


SRAM,DQO=15_ KKKKKKKKKK odt KX seen oe ae me XXXKXKAD aan ¢ smn {X 


— BPU,DAO-15 — AXXXMXXXXA Lt AXA 2) aR Oe — nae au — ata oe 


BPU, DLE | ; + 

eine | — 7 : 
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a re ee ee 
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oe 
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es ee 
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_ FIGURE 10. Typical BITBLT Timing 
- (Using Fast Static RAM) | 


TL/F/8672-10 
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Applications of the DP8510 BPU (continued) 


CPU BPU 


MEMORY 


MEMORY 
CONTROLLER 


FRAME BUFFER 


ADDITIONAL MEMORY 


CONTROL 


TL/F/8672—12 


FIGURE 11. BPU with CPU using String/Block Move instructions 


Figure 17 presents a solution requiring minimal hardware 
but whose performance may be limited by the CPU’s bus 
cycle time. In this example, the CPU’s block move or string 
move instruction is used to produce the source and destina- 
tion addresses for the BITBLT. The BPU, in conjunction with 
its support logic, intercepts the source data during the read 
cycle. The support logic is also responsible for turning the 
string/block instruction’s write cycles into read-modify- 
writes to allow the BPU to receive the destination data, logi- 
cally combine it with the source data, and return it to the 
destination address. The result is a relatively simple hard- 
ware/software implementation of a single scan line of the 


BITBLT. 
REQ 
‘ 


REQ 


MEMORY 
CONTROLLER 


GRANT STATE MACHINE 
SOURCE/DESTINATION 
ADDRESS COUNTERS 


FRAME BUFFER 


ADDITIONAL MEMORY 


Performance can be further enhanced, at the cost of greater 
hardware complexity, through an implementation as de- 
scribed in Figure 12. As in the previous example, this solu- 
tion performs a single scan line of the BITBLT; however, in 
this case, the BITBLT proceeds without any involvement on 
the CPU’s part. All intermediate addresses within the scan 
line are generated by up-down counters; all bus cycles are 
effected by the state machine’s hardware. Within this 
scheme, further tradeoffs may be made. For example, the 
designer has the option of using the BPU’s source pipeline 
to take advantage of faster memory access modes, such as 
page mode. 


TL/F/8672~13 


FIGURE 12. BPU with CPU and state machine 
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DP8510 


Absolute Maximum Ratings 


if Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales 
Office/Distributors for availability and specifications. 


Temperature under bias 
0°C to + 70°C (Comm.) 

— 55°C to + 125°C (Mil.) 

Storage Temperature Range 


DC Electrical Characteristics 


~—65°C to + 150°C | 


Power Dissipation @ 20 MHz 


All input or output voltage with 
respect to GND —0.5V to +7V 


0.5W 


' ESD rating is to be determined. 
_ Note: Absolute maximum ratings indicate limits beyond 


which permanent damage may occur. Continuous operation 
at these limits is not intended; operation should be limited to 
those conditions specified under DC Electrical Characteris- 
tics. | 


Ta = 0° to 70° (Comm.), —55°C to + 125°C (Mil.), Voc = 5V +10%, GND = OV 


symbol_| Characteristics |__—=—Conditions. | Min. ~— | Typ_| Max | _Unit 
a ee Ss Bet sa ee Be 
Mw ros low 
Vou | MOSClockHigh | PHI,PH2PinsOnly,MOS | Voc-08 | | Voc+os | V 
Vor___| MOSClockLow | _-PHI,PH2PinsOny,Mos_ | -os | | a3 | 
Voir | _MOSClockRinging | PH1,PH2PinsOnly,Mos | 05 | | 08] 
Vrouw | TTLCiockHigh | TTLClock/PHIPinOny | 25 | | TV 
Vio | TTL GlockLow | ‘TTL Clock/PHIPinony | = | S| S| 
Vow | to =-3mA | ee | 
Vo | | te =ama || ls 
in | Leakage Curent | Vin=Vworve | | | pA 
loz___|_TRISTATE® Leakage | Vo=Vocor@ND ss | | S| St |g 
Icct___| Quiescent Current | PHI,PH2at20MHz | || 12m 
lcce___| Supply Current | PHI,PH2attookHz =| | | tm 
loca | _SupplyCurrent_ | PHI,PH2at2oMHz | | | 28 | mA 
Cw | _InputCapacitance | finatiMHz =| | ft 
Thermal Resistance—Ceramic PGA Package Pf oe 
OJ Junction to Ambient °C/W 
PJc Junction to Case °C/W 


Note: All output test conditions are 50 pF plus one TTL load. 
~  §.0V 


TEST POINT 


TL/F/8672-14 
FIGURE 13. BPU Output Test Load Circuitry 
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-  S.0V GND 


TL/F/8672-15 


FIGURE 14. BPU Output TRI-STATE Test Load Circuitry 
Note 1:C1 = 50 pF 


R1 = 6 kn 
R2 = 1.3 kN 
R3 = 1.8 ko 


Note 2:.Connect SW to +5V for tpLZ and tpZL measurements. 
Note 3: Connect SW to GND for tpHZ and tpZH measurements. 


DP8510 AC Electrical Characteristics* 


em 
Name | Fue: Description Conditions a 
| Min | Max | Min | ax 


fmos_ | | PH1,PH2MOSClock Frequency | TOS=0 | S| 20 MHz | | 20 MHz 
telmos] |__| PH1,PH2MOSClockPeriod S| TCS=0 | 80 | | 8 
tl | | PHITTLClock Frequency = | TOS=1 | MHz || 17 MHz 
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tet] | | PHiTTLClockPeriod | TOS=1 st 88 | CT 888 
tpt | 16 | PHiHighTime(TCS=0) | RESO%toNoxtFESO% | 19 | | 19 | 
tphiH_ | 16 | PHiHighTime(TCS=1) | CRE 1.SVtoNoxtFE tov | 17 | | 17 | 
tphit__| 16 | PHiLowTime(TcS=1) | CFE 1.SVtoNextREDSV | 17 | | 17 | 
tphe__ | 16 | PH2HighTime(TCS=0) | RESO%toNextFESO% | 19 | | 19 | 
tckt_ | 16 | SO%PHIREtoSO™PH2RE | | | 
tok2 50% PH2 RE to 50% PH1 RE aren bs ee Pe 
_tnovt_ | 16 | Non-OveriapTime | PH2to-PH150% | 3 | tS 
tnova__| 16 | Non-OverlapTime | PHIto-PH250% | 3 | | 
tcl_ | 19 | DataSwitchingTime | FromL/BorDos1sv_ | 43 | | 43 
tlesat_ | 17 _| DLE, PDLE,CRE Setup Time(TCS=0) | BeforePH2FESO% | 5 | | 5 | 
tlestr__| 17 __| DLE,PDLE,CRE Setup Time(TCS=1) | BeforePHIRETSV | 5 | | 5 | 
tlenat_ | 17 | DLE,PDLE,CREHoldTime(TCS=0) | AfterPH2FE50% | 5 | | 5 | 
tlehtr | 17 | DLE,PDLE,CREHoldTime(TCS=1) | AfterPHIRE15V | 8 | | 8 | 
tleis | 17 | DLE,PDLE,CRE Invalid | BeforePH2RESO% | 5 | | 5 | 
tdszt_| 17 | DIL.CRDataSetupTime(TCS=0) | BeforePH2FE50% | 5 | | 5 _ 
tanat_| 17 __| DIL,CRDataHoldTime(TCS=0) | AfterPH2FESO% || 8 | | 8 | 
tdstr__| 17 | DiL,CRDataSetupTime(TCS=1) | BeforePHIRETSV | 5 | | 5 | 
tahir_ | 17 __| DiL,CRDataHoldTime(TCS=1) | AfterPHIRE1.5V_ | 10 | | 10 | 
tctlsat_ | 17 | RESETSetupTime(TcS=0) | BeforePH2FESO% | 5 | | 5 
tetinae_ | 17 | RESETHoldTime(TCS=0) | AfterPH2FES0% | 8 | | 8 _| 
tctistir_| 17 | RESETSetupTime(TCS=1) | BeforePHIRET.SV | 5 | | 5 | 
tetintr | 17 | RESETHoldTime(TCS=1) | AfterPHIRE1.5V_ | 10 | | 10 | 
tbseh2f_| 20 | BSEHoldTime(TCS=0) | After PH2FE50% | 5 | | 5 
tbsestr | 20 | BSESetupTime(TCS=1) | BeforePHIRE1.SV | 10 | | 10 
tbsehir | 20 | BSEHoldTime(TCS=1) | AfterPHIRET.SV | 5 | 8 
tbishat_| 20 | BISHoldTime(TCS=0) | After PH2FESO% | 6 | | 5 
tbissir_| 20 _| BIS Setup Time (TCS=1) | BeforePHIRE1SV | 15 | | 15 | 
tbishtr_| 20 | BISHoldTime(TcS=1) | AfterPHIRE1.SV | 5 | |S 
tpdnir_ | 21 | ValidPixelData | AfterPHIRESO% | | a7 | S| 
tdpzH_ | 18 | DataTRLSTATEtoActiveHigh | AfterDOE,POEFE15v | | 25 | | 30 
tdpzL_| 18 | DataTRLSTATEtoActiveLow | AfterDOE,POEFE1S5V | | 25 | | 30 
tpHZ_| 18 | DataActiveHightoTRI-STATE | AfterDOE,POERE15V | | 25_ | | 30 
tdpLZ_ | 18 | DataActiveLowtoTRI-STATE | AfterDOE,POERE15V | | 25 | | __30 
tdqir_ | 22 | Valid OutputData(DOS=0) | AfterPHIRESO% | | 40] 
tdgir_ | 22 | Valid OutputData(DOS=1) | After PHIRESO% | | 30] | 
tviwr | 23 | ValidFiFOWrite(FWR)t | FromDle | tte «| tte | tte | tte 
tvird | 23 | Valid FIFORead (FAD)? | FromvalidFWR | ate | te 


*Conditions 
Comm.: Ta = 0° to +70°C, Voc = 5V + 10%, GND = OV 
Mil.: Ta = —55° to + 125°C, Voc = 5V + 10%, GND = OV 
All values are expressed in nanoseconds unless otherwise specified. ; 
**Military Specifications are preliminary. Please contact your National Semiconductor Sales Office or Distributor for availability and specifications. 
TTo insure that valid source data is written to the FIFO, FWR must occur one PH1 cycle time after DLE. 
ttTo ensure that valid data is read from the FIFO, there must be at least two PH1 periods between FIFO write and FIFO read cycles when the FIFO read and write 
counters are equal. Please consult additional criteria in applications section of this datasheet. 
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Timing Diagrams 
DEFINITIONS 


All the timing specifications given in this section refer to 50% of the leading or trailing edges of the appropriate clock phase and 
0.5V or 3.0V on the appropriate signal as illustrated in the following figures, unless specifically stated otherwise. 


DP8510 


PHI 


SIG 


PH2 


TL/F/8672—16 
Note: Data is measured at 1.5V to 50% points of PH1 and PH2. 


FIGURE 15. Timing Specification Standard 
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*TTL Clock Specification ; 
FIGURE 16. Two-Phase MOS/Single-Phase TTL Clock Timing Specification 
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FIGURE 17. Input Data Timing Specification 
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Timing Diagrams (Continued) 
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FIGURE 18. Output Enable Timing Specification 


VALID INPUT 


B/L, DOS 


nope noon) 
PDOn : VALID DATA | 


Voi 
TL/F/8672-—20 


FIGURE 19. Mode Control Timing Specification 
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FIGURE 20. Control Signal Timing Specification 
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Timing Diagrams (continued) 


PH1 


PH2 


VALID PIXEL DATA 
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DLE 3.0V 
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FIGURE 21. Pixel Read Timing Specification | 
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FIGURE 22. FIFO/DIL-Dest Data Read Timing Specification 
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Timing Diagrams (Continued) 
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FIGURE 23. DLE & FWR Timing Relationships 
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DP8511 


National 
Semiconductor 


DP8511 BITBLT Processing Unit (BPU) 


General Description 


The DP8511 BITBLT Processing Unit (BPU), a member of 
National Semiconductor’s Advanced Graphics Chip. Set 
(AGCS), is a high performance microCMOS device intended 


for use in raster graphics applications. Specifically designed . 


to complement the DP8500 Raster Graphics Processor 
(RGP), the BPU performs data operations that are elemen- 
tary to BITBLT (BIT boundary Block Transfer) graphics: 
Shift, mask, and bitwise logical manipulation of memory. Un- 
der the control of the RGP, the BPU performs the necessary 
BITBLT data path operations at pipelined hardware speeds. 
A simple set of control lines interfaces the BPU to the RGP 
and to the system memory. 


The BPU has two modes of operation: BITBLT and Line 
Drawing. BITBLT performs shift and logical operations on 


blocks of 16-bit data words. Line drawing performs similar. 


operations on single-bit pixel data by utilizing a single bit 
pixel port (PDn). This port allows data read and read-modify- 
write operations on single pixels across a number of bit- 
planes, giving access to pixel depth. The BPU provides both 
pixel level processing commonly used in image processing 
applications and extremely fast planar operations used 
most frequently in color graphics. 


The BPU’s operation is controlled by the values loaded to 
the Control Register (CR) and the Function Select Register 
(FSR). This dual register configuration of the DP8511 allows 
for high throughput in multi-plane systems that incorporate a 
BPU per piane. This performance advantage is achieved by 
allowing the flexibility of changing the FSR’s contents inde- 


Connection Diagrams 


pendent of the CR, so that multiple bitplanes can be updat- 
ed simultaneously while each BPU performs different logical 
operations on its own destination data. 


Features 

. interfaces directly to the DP8500 Raster Graphics 
Processor or any general purpose controller 

@ 20 MHz operation 

m™ Supports all 16 classical BITBLT functions 

Pipelined data input for high system throughput 

m Provides performance independent of the number of 
bitplanes 

Line Drawing support 

m™ Compatible with static, dynamic RAMs, and Video 
RAMs 

m™ Compatible with page mode, nibble mode and static 
column RAMs 

@ 32-bit to 16-bit barrel shifter 

w 16-bit data port, single bit pixel port 

m 16-word FIFO 

16-bit logic operations 

@ Single +5V supply 

@ All inputs and outputs TTL compatible 

2 micron microCMOS technology | 

@ Packaged in a 44-pin PCC (commercial) or 44-pin PGA 
(MIL) 


44-Pin Plastic Chip Carrier (PCC) 


~~ ld kal 
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N.C. = No Connection 
Top View 
Order Number DP8511V 
See NS Package Number V44A 
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Connection Diagrams (Continued) 
44-Pin Grid Array (PGA) Package 


Description _s Description pa 
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Bottom View 
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D3 | BVCC1 
BVCCO | opt : 
D2 3 LGND 
BGNDO  BGND2 @@@®@ : 
D1 D10 
6 4 Q@@O@®@ A 
PDQn D8 | 8 7 6 5 4 3 2 1 
POE ) N.C. TL/F/9337~20 
L/B | Order Number DP8511U 
BO/LME | See NS Package Number U44A 
B1/RME 
Pin Descriptions 
INPUTS B2/FWR: Bit Select 2/FIFO Write Control. In the line draw- 
DOE: Data Output Buffer Enable. When asserted low, ing mode, this pin serves as the second most 
this pin enables the output buffers on DO—D15. significant pixel address bit. When in the BITBLT 
This input is generally driven by memory control mode, this pin is used as the active high FIFO 
logic. Write control input. This input must be synchro- 
POE: Pixel Output Buffer Enable. Enables the Pixel ee 
Output Buffer when active (low). This pins allows P y y be 
for reading pixel data from the frame buffer in a B3/FRD: Bit Select 3/FIFO Read Control. In the line draw- 
multiple bitplane system. This signal is generally ing mode, this pin serves as the most significant 


pin is used as the active high FIFO Read input. 
This input must be synchronized with respect to 
the falling edge of PH2. This pin is normally driv- 
en by the RGP. 


BSE: BITBLT Source Enable. This pin enables the 
BITBLT source input data path and controls the 
latching function of the BITBLT source pipeline 
register. BSE is sampled and latched on the fall- 
ing edge of PH2 clock. The level on BSE must be 
asserted one clock cycle prior to loading data to 


L/B: Line Drawing or BITBLT. A high on this input en- 
ables the BPU for line drawing mode. A low en- 
ables the BITBLT mode. This input is normally 
driven by the RGP. 


BO/LME: Bit Select 0/Left Mask Enable. In the line draw- 
ing mode, this pin serves as the least significant 
pixel address bit. When in the BITBLT mode, this 
pin is used as the active high Left Mask Enable 
input. This input must be synchronized with re- 


sper iio tie lallligiedge Ober, Ths Bp ene the Data Input Latch (DIL), and must remain con- 


mally driven by the RGP. 
stant during the subsequent data load cycle to 
B1/RME: Bit Select 1/Right Mask Enable. In the line draw- ensure data integrity. This input is driven by the 


ing mode, this pin serves as the second least - B@P. 


significant pixel address bit. When in the BITBLT ee a 
eos TCS: TTL Clock Select. This pin is tied to either Voc or 
MOS) Ne pin sle Used: a8 tne active nigh rugit GND. A high level on this pin selects the TTL 


Mess nate pubs Tie Oput must be syncnle: level clock input mode which uses a convention- 


nized with respect to the falling edge of PH2. Rea 
This pin is nornety driven by the Ace. al TTL level clock of up to 17 MHz, simplifying 
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Pin Descriptions (Continued) 


TCS: the system design. When using TTL clock 

(Continued) mode all references to the falling edge of PH2 
must be changed to the rising edge of PH1. A 
low level selects a 2 phase MOS clock mode 
which uses PH1 and PH2 as clock inputs. 


Phase 1 clock input/TTL clock input. When 
TCS is low, this input serves as the Phase 1 
MOS clock input with a maximum clock rate of 
20 MHz. When TCS is high, this input is the TTL 
clock input with a maximum rate of 17 MHz. 


Phase 2 clock input. This pin is the Phase 2 
MOS clock input with a maximum rate of 20 
MHz. When using a TTL clock, this input must 
be tied to ground. 


BPU Control Register Enable. The data on DO- 
D15 is latched into the Control Register on the 
rising edge of CRE. This input is driven by mem- 
ory decode logic. 

Data Latch Enable. A low on this pin enables 
the BPU’s Data Input Latch (DIL). The data on 
DO-D15 is latched into the DIL on the falling 
edge of the PH2 clock. DLE must be synchro- 
nized with respect to the valid data and must be 
removed before the rising edge of the subse- 
quent PH2 clock. This input is generally driven 
by memory control logic. 


BPU Function Select Register Enable. The data 
on DO-D15 is latched into the Function Select 
Register on the rising edge of FSE. This input 
generally is driven by memory decode logic. 


Pixel port Data Latch Enable. A low level on this 
pin enables the BPU’s pixel port data input 

‘latch. The data on the PDn pin is latched into 
the one-bit Pixel Input Latch on the falling edge 
of the PH2 clock. PDLE must be synchronized 
with respect to the valid pixel data and must be 
removed before the rising edge of the subse- 
quent PH2 clock. 


Data Output Select: DOS selects the data out- 
put from either the FIFO (DOS = 1) or the 
BITBLT logic unit (DOS = 0). This input should 
be grounded when the BPU is controlled by an 
RGP. 


FIFO control Reset. A low on this pin resets the 

- BPU’s FIFO read/write control circuitry. Data 
previously stored in the FIFO or on-chip latches 
is unchanged. This input must be synchronized 
with respect to the falling edge of PH2. 
PA2/FWR and PA3/FRD inputs must be low 1 
clock cycle prior to and 2 clock cycles after as- 
serting RESET. This pin is controlled by either 
the RGP’s RSTO line or a TTL level reset sig- 
nal. 


INPUTS/OUTPUTS 


DO-D15: 16-bit Bidirectional Data Port. This port serves 
as the input to the Data Input Latch, Control 
Register, and to the Function Select Register. 
When DOE is active (low), this Pott serves as a 
16-bit output buffer. 


PH2: | 


1 


oO 
r 
m 


v0 
oO 
r 
m 


DOS: 
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Pixel Data Port, a single bit bidirectional port. 


PDn: 
| This I/O port is used in the line drawing mode (L/ 
= 1). It serves as the input to the Pixel Input 
Latch, providing a source bit for line drawing. 
When POE is active (low), this port serves as an 
output for the pixel previously latched in the Pixel 
Output Latch (POL). 
SUPPLIES 
LVCC: Positive supply for on-chip logic. 5 Vpop +10%. 
LGND: Ground for on-chip logic. 
BVCCO- 
BVCC1: Positive supply for output buffers, two pins total. 
5 Voc £10%. 
BGNDO- 
BGND3: Ground for output buffers, four pins total. 


BITBLT Fundamentals 


BITBLT, BiT-aligned Block Transfer, is a general operator 
that provides a mechanism to move an arbitrary size rectan- 
gle of an image from one part of the frame buffer to another. 
During the data transfer process a bitwise logical operation. 
can be performed between the source and the destination 
data. BITBLT is also called RasterOp: operations on rasters. 
It defines two rectangular areas, source and destination, 
and performs a logical operation (e.g., AND, OR XOR) be- 
tween these two areas and stores the result back to the 
destination. It can be expressed in simple notation as: 


Destination <— Source op Destination. 
op: AND, OR, XOR, etc. 


FRAME BUFFER ARCHITECTURE 


Generally, there are two kinds of frame buffer architectures: 
PLANE-oriented or PIXEL-oriented. BITBLT takes advan- 
tage of the plane-oriented frame buffer architecture’s attri- 
bute of multiple, adjacent pixels-per-word, facilitating the 
movement of large blocks of data quickly in a frame buffer. 


- However, the plane-oriented architecture has one inherent 


problem: the limit of resolution for memory addressing and 
access is the word, rather than the pixel. The BITBLT 
source starting address, the BITBLT destination starting ad- 
dress, the BITBLT width and the BITBLT height are all de- 
fined in pixels. The BITBLT source data block may start and 
end at any bit position of any word, and the destination data 
block also may start and end at any bit position of any word. 


BIT ALIGNMENT 


Before a logical operation can be performed between the 
source and the destination data, the source data must first 
be bit aligned to the destination data. In Figure 1, the source 
data need to be shifted three bits to the right in order to 
align the first pixel (that is, the pixel at the top left corner) in 
the source data block to the first pixel in the destination data 
block. For maximum performance, this alignment function 
must be implemented with a barrel shifter. . 


WORD BOUNDARIES AND DESTINATION MASKS 


Each BITBLT destination scan line may start and end at any 
position in any data word. The neighboring bits (the bits 
sharing the same word address with any words. in the desti- 
nation data block, but not a part of the actual BITBLT rec- 
tangle) of the BITBLT destination scan line must remain un- 


BITBLT Fundamentals (Continued) 


changed after the BITBLT. Due to the plane-oriented frame 
buffer architecture, all memory operations must be word- 
aligned. In order to preserve the neighboring bits surround- 
ing the BITBLT destination block, a left mask is needed for 
all the leftmost data words of the destination block, and a 
right mask is needed for all the rightmost data words of the 
destination data block. Both the left mask and the right 
mask remain the same throughout a given BITBLT opera- 
tion. 


WORD BOUNDARIES 


—O123456789ABCDEF0123456789ABCDEF 


02 SSSSSSSSSSSSSSSSSSSS 
04 SSSSSSSSSSSSSSSSSSSS 
06 SSSSSSSSSSSSSSSSSSSS 
08 SSSSSSSSSSSSSSSSSSSS 
OA SSSSSSSSSSSSSSSSSSSS 
oc SSSSSSSSSSSSSSSSSSSS 


PIXEL 
NUMBER 


20 -DDDDDDDDDDDDDDDDDODDD 
22 DDDDDDDDDDDDDDDDDDDD 
24 DDDDDDDDDDDODDDDDDDDD 
26 DDDDDDDDDDDDDDDDDODDD 
28 DDDDDDDODDDDDDDDDDODD 
2A DDDDDDDDDDDDDDDDDDDD 


[ MEMORY ADDRESS 
TL/F/9337—17 


FIGURE 1. A 32 by 32 Frame Buffer 


The following example illustrates the bit alignment require- 
ment. In this example, the graphics controller has a 16-bit 
wide data bus. Figure 7 shows a 32 pixel by 32 scan line 
frame buffer which is organized as a long bit stream which 
wraps around every two words (32 bits). Further, the top left 
corner of the frame buffer starts from the lowest word in the 
memory, address O0O0Ohex. Each word in the memory con- 
tains 16 bits, DQ0-DQ15. The most significant bit of a 
memory word, DQ0, is defined as the first displayed pixel in 
a word. In other words, memory’s DQO to DQ15 correspond 
to pixels 0 to 15 respectively. In this example, BITBLT ad- 
dresses are expressed in terms of pixel number, starting 
(with 0) from the upper-leftmost pixel. The BITBLT source 
starting address is set to 021hex (the second pixel in the 
third word). The BITBLT destination starting address is set 


1-87 


to 204hex (the fifth pixel in the 33rd word). The BITBLT 
width is set to 013hex (=19 decimal, corresponding to a 
width of 20 pixels). The BITBLT height is set to OO5hex (= 5 
decimal, corresponding to 6 scan lines). 


The left BITBLT mask for the above example is: 
0000,1111,1111,1111 
The right BITBLT mask for the above example is: 


1111,1111,0000,0000 


Note: Zeroes in either the left mask or the right mask indicate the destina- 
tion bits which will not be modified. 


BITBLT DIRECTIONS 


The BITBLT moves a rectangular block of data in a frame 
buffer. For a plane-oriented frame buffer, the BITBLT pro- 
cess can be considered a subroutine which has two nested 
loops. The loops are preceded by the BITBLT setup compu- 
tations. The outer loop is the BITBLT source and destination 
scan line pixel starting address calculation and line count 
test for completion. The innermost loop is the actual BITBLT 
data movement for a single BITBLT scan line and word 
count test for completion. The length of the innermost loop 
is the word count of the BITBLT width. The length of the 
second loop is equal to the BITBLT’s height (number of 
scan lines involved in a BITBLT): 


BITBLT: calculate BITBLT setup parameters ;once per BITBLT 
such as 
width, height 
bit misalignment (shift number) 
left, right masks 
horizontal, vertical directions 
etc 


OUTERLOOP: calculate source, dest addresses ;once per scanline 
INNERLOOP: move data and increment addresses ;once per word 


UNTIL done horizontally 
UNTIL done vertically 
RETURN (from BITBLT). 


Each loop can be executed in one of two directions: the 
inner loop from left to right or right to left, the outer loop 
from top to bottom (down) or bottom to top (up). 


The ability to move data starting from any corner of the 
BITBLT rectangle is necessary to avoid destroying the 
BITBLT source data as a result of destination writes when 
the source and destination are overlapped (i.e., when they 
share pixels). This situation is routinely encountered while 
panning or scrolling. 


A determination of the correct execution directions of the 
BITBLT must be performed whenever the source and desti- 
nation rectangles overlap. Any overlap will result in the de- 
struction of source data (from a destination write) if the cor- 
rect vertical direction is not used. Horizontal BITBLT direc- 
tion is of concern only in certain cases of overlap, as will be 
explained below. 


Figure 2 (a) and (b) illustrate two cases of overlap. Here, the 
BITBLT rectangles are three pixels wide by five scan lines 
high; they overlap by a single pixel in (a) and a single col- 
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BITBLT Fundamentals (continue) 


umn of pixels in (6). For purposes of illustration, the BITBLT 
is assumed to be carried out pixel-by-pixel. This convention 
does not affect the conclusions. 


In Figure 2(a), if the BITBLT is performed in the UP direction 
(bottom-to-top) one of the transfers of the bottom scan line 
of the source will write to the circled pixel of the destination. 
Due to the overlap, this pixel is also part of the uppermost 
scan line of the source rectangle. Thus, data needed later is 
destroyed. Therefore, this BITBLT must be performed in the 
DOWN direction. Another example of this occurs any time 
the screen is moved in a purely vertical direction, as in 
scrolling text. It should be noted that, in both of these cases, 
the choice of horizontal BITBLT direction may be made arbi- 
trarily. 
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(b) TL/F/9337-19 
FIGURE 2. Overlapping BITBLT Operations 


Figure 2(b) demonstrates a case in which the horizontal 
BITBLT direction may not be chosen arbitrarily. This is an 
instance of purely horizontal movement of data (panning). 
Because the movement from source to destination involves 
data within the same. scan line, the incorrect direction of 
movement will overwrite data which will be needed later. In 
this example, the coreg t direction is from right to left. 


BITBLT VARIATIONS | 


Some implementations of BITBLT are defined in terms of 
three operands: source, destination and mask/texture. This 


third operand is commonly used in monochrome systems to 
incorporate a stipple pattern into an area. These stipple pat- 
terns provide the appearance of multiple shades of gray in 
single-bit-per-pixel systems, in a manner similar to the ‘half- 
tone’ process used in printing. 


Destination <— Texture op1 Source op2 Destination 


While the BPU is essentially a two-operand device, three- 
operand BITBLT can be implemented quite flexibly and effi- 
ciently by performing the two operations serially. The BPU 
permits the use of any of its sixteen operations for each of 
the two operators shown above as ‘op1’ and ‘op2’. Addition- 
ally, the on-chip FIFO can be used to store the intermediate 
result (the result of op1 later used as an operand of op2) 
thus minimizing the number of memory accesses required. 


ENHANCING BITBLT PERFORMANCE 


There are various ways to enhance BITBLT performance 
(speed). The simplest way is to try to get data in and out of 
the memory system quickly. Most of the bitmapped graphics 
systems utilize DRAMs for both cost and storage density 
reasons. Since the BITBLT data shows strong locality, the 
graphics system can take advantage of certain fast memory 
access modes available to the DRAMs, such as page mode 
access, static column access, etc. The BPU, by means of 
an internal FIFO, can pipe the BITBLT source data to re- 
duce the frequency of switching out of the current page ad- 
dress space, thus maximizing the ability of the system to 
capitalize on the data’s locality. This operation is described 
in the following section. 


PIPING THE BITBLT SOURCE DATA 
When the BITBLT width is more than a word, up to 16 


_ source data words can be piped into the BPU’s on-chip 
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FIFO. At the end of each BITBLT scan line or at the end of 
16 source data words, the controller switches from the 
BITBLT source address space to the BITBLT destination 
address space. When the BITBLT destination data word ‘is 
fetched, two possible memory control sequences can be 
used. One is the modify-write sequence: write the BITBLT 
result back to the destination memory immediately after the 
logical function is executed. The second sequence involves 
storing the BITBLT result back to the BPU’s on-chip FIFO, 
to a maximum of 16 words. Either at the end of each 
BITBLT destination scan line or at the 16th destination data 
word, the BITBLT resultant data is then read out from the 
FIFO and written to the BITBLT destination memory se- 
quentially. 


Summary of the BITBLT Memory. 
Control Sequences 


BITBLT MEMORY SEQUENCE | 

1) Load the BPU Control Register with [BIS, SN, LM, RM], 
via the data bus. 

2) Load the BFU FS register with function select code. 

3) Read in the BITBLT source data up to 16 words (17 
words in certain cases), barrel-shift, then write them into 
the on-chip FIFO. (Only 16 barrel-shifted data words can 
be stored.) 

4) Read in the BITBLT destination data while the barrel- 
shifted source data is read out from the on-chip FIFO and 
the selected logical operation is executed, then write 
back to destination. 


Summary of the BITBLT Memory Control Sequences (continueg) 


5) Go to step 3 until the end of the BITBLT scan line. 
6) Go to step 3 for the remaining BITBLT scan lines. 


BITBLT MEMORY SEQUENCE II 

1) Load the BPU Control Register with [BIS, SN, LM, RM] 
via the data bus. 

2) Load BPU FS register with function select code. 

3) Read in the BITBLT source data up to 16 words (17 
words in certain cases), barrel-shift, then write them into 
the on-chip FIFO. (Only 16 barrel-shifted data words can 
be stored.) 


4) Read in the BITBLT destination data in sequence, exe- 
cute the selected logical operation and then write the re- 
sult back to the on-chip FIFO, maximum 16 BITBLT data 
words. 


5) Read BITBLT result from the FIFO and write them back 
to the BITBLT destination memory. 


6) Go to step 3 until the end of the BITBLT scan line. 
7) Go to step 3 for the remaining BITBLT scan lines. 


DP8511 BIT ORDER 


The DP8511’s bit order was defined so it reflects the se- 
quence in which pixel data is viewed in a downward facing 


Cartesian Coordinate display system (see Figure 3b ). \n this 


type of display configuration, the pixel address increases as 
one scans from left-to-right as well as from the top to the 
bottom of the display. When addressing individual pixels 
within a given 16-bit word, a 4-bit pixel address value is 
assigned to each pixel, or bit, of the word and the address 
value increases from the leftmost pixel in the word to the 
right. This allows for the formation of a uniform pixel ad- 
dress space simply by concatenating the pixel address (4 
bits) to the LSB of the word address. 


The DP8511’s internal word bit-order is defined to be con- 
sistent with the order in which pixel data is displayed on the 
screen (i.e., from DO on the left to D15 on the right). This 
convention provides logical continuity when observing how 
data is manipulated inside the DP8511 and establishes the 
relationship to the display pixel addressing scheme. The pix- 
el address bits BO-B3 map directly to this left-to-right bit 
order convention. 


It should be noted that when data is viewed in the frame 
buffer the order of the pixel bits will be consistent with most 
conventional microprocessor conventions that define DO in 
the LSB bit position or in the rightmost bit location (see 
Figure 3a). 


16 Bits 
Shift Order 
D15 DO 
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Word 0 (Address n) 
Word 1 (Address n#1) 


a) Data Representation in Frame Buffer 
(as Seen by the CPU) 
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b) Data Representation in DP851 1-Based Display 
(Downward Facing Cartesian Coordinate System) 


FIGURE 3. Frame Buffer vs Display Coordinate Bit-Order Representation 
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Functional Block Description 


BPU CONTROL REGISTER 


The BPU Control Register (CR) is a 13- bit register consist- 
ing of three 4-bit fields and one single-bit field. The Right 
Mask field, RM, controls the BITBLT right mask pattern. The 
Left Mask, LM, controls the BITBLT left mask pattern. The 
barrel Shift Number, SN, determines the number of bit posi- 
tions shifted by the Barrel Shifter. The BIS bit controls the 
organization of two 16-bit words at the input to the barrel 
shifter. 


FIGURE 4. Control ce Fields 


The CR is loaded by presenting the appropriate data at the 
Data Port (D0-—D15) and then asserting a low level on the 
CRE (Control Register Enable) input. Data is latched into 
the register on the rising edge of CRE. 


When the DP8511 is controlled by the RGP, the CR is treat- 
ed as a memory mapped write only register and resides in 
the RGP’s memory space. The address of this register is 
determined by the RGP’s BPUB register (more information 
can be found in the RGP Programmer’s Reference Manual). 
The CR Register contents are not affected when the BPU is 
in the line drawing mode (L/B = 1). 


Left and Right Masks 


The Left and Right mask fields are used to determine which 
portion of the destination data word to preserve during a 
BITBLT operation. Masks are used when a destination start- 
ing or ending address falls at a pixel address that is not on a 


word boundary. Since data is accessed and modified in 


16-bit word quantities, neighboring bits on both sides of the 
destination data block must be protected or they risk being 
corrupted when the destination data is logically combined 
with the source data. The masks isolate the logic operation 
to only those bits that are enabled with a one in the mask 


fields (of course, RME and/or LME must be asserted con- | 


currently). The truth table for the left and right mask patterns 
is shown in Table I. 


The RM and LM values are automatically determined by the 
RGP during BITBLT and are based on the starting address 
of the destination data block specified in the DSAD register. 


Shift Number 


There are 16 different barrel shift operations controlled by 
the SN of the Control Register. Let A = a15...a0,B = 


1-90 


b1i5...b0andC = c15...c0, Aand B are the input words, 
C is the output. An i-bit barrei-shift operation on A and B 
denoted by C = S(i; A, B) is the operation of concatenating 
the least significant (i) bits of word B to the most significant 
(16-i) bits of word A. 

When A = B, the C = Si(i; A, B) is equivalent to having an 
i-bit circular left shift. The i-bit circular right shift is equivalent 
to the (16-i) bit circular left shift. 


The truth table of the barrel-shift operation S(i, A, B) is 
shown in Table Ill. The shift number is represented by ‘‘i’”’ 
AO-A15 corresponds to the most significant input word of 
the barrel shifter, and BO-B15 corresponds to the least sig- 
nificant input word of the barrel shifter. CO-C15 is the barrel 
shifter output. 


Barrel Input Select (BIS) 


The Barrel Input Select bit is used to route two 16-bit words, 
one in the Barrel Input Latch (BIL) and the other in the Data 
Input Latch (DIL), to either the most significant or the least 
significant word position of the Barrel Shifter. When the BIS 
bit is set high, the DIL source serves as the most significant 
input word to the barrel shifter with BIL data going to the 
least significant word position. When the BIS bit is set low, 
the DIL source input is routed to the least significant posi- 
tion of the barrel shifter and BIL data goes to the most sig- 
nificant position. 


The criteria for setting the BIS bit can be determined by the 
direction of the BITBLT source data read operation. When 
transferring data words from memory to the BPU in a left-to- 
right sequence, the BIS bit should be set to zero so that the 
first source word fetched into the source pipe will be routed 
to the most significant word position of the barrel shifter 
while the second will go into the least significant position. In 
the case where data is being fetched in a right-to-left fash- 
ion, it becomes necessary to swap the two source words at 
the input to the barrel shifter so the bit order of the two 
words will remain contiguous. This is done by setting the 
BIS bit high. 

This bit is automatically set by the RGP during BITBLT oper- 
ations. 


FUNCTION SELECT REGISTER 


The Function Select Register (FSR) is a 4-bit register used 
to set one of 16 BITBLT logic operations to be performed 
between the source and a destination data blocks. The 
function select truth table can be found in Table Il. 


Functional Block Description (Continued) 
TABLE I. Left and Right Mask Truth Table 


OOnN Daf WN = O 


RM = 
RM = 
RM = 
RM = 
RM = 
RM = 
RM = 
RM = 
RM = 
RM = 
RM = 10 
RM = 11 
RM = 12 
RM = 13 
RM = 14 
RM = 15 


OAOAnN Oath OD — 


L_mask = 
L_umask = 
L_emask = 
L__mask = 
L_mask = 
L_emask = 
L_mask = 
L_wmask = 
L__mask = 
L_mask = 
L__mask = 
L_mask = 
L_mask = 
L_mask = 
L_omask = 
L_umask = 


R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 
R__mask = 


“1” = Enable Logic Op 
“0” = Disable Logic Op 


11111111 
11111111 
11111111 
11111111 
114111111 
01111111 
00111111 
00011111 
00001111 
00000111 
00000011 
00000001 
00000000 
00000000 
00000000 
00000000 


00000000 
00000000 
00000000 
00000000 
10000000 
11000000 
11100000 
11110000 
11111000 
11111100 
11111110 
11111111 
11111111 
11111111 
11411111 
11111111 
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The 16 BITBLT functions can be expressed as: 
{3 *(—s* —d) + 
f2*(—s*d) + 
f1*(s* —d) + 
fO * (s * d) 
where f3-—fO are the bits of the Function Select Registers, ‘s’ 
is the source data and ‘d’ is the destination data. 


The FSR is loaded by asserting the Function Select Enable 
(FSE) pin low while valid data is made available at the data 
port (D0-D15). Data is latched into the register on the rising 
edge of FSE. 

D3 DO 


f8 | f2 1 fii fd 
FS 


FIGURE 5. Function Select Register (FSR) 


The contents of the FSR are defined by the user prior to 
BPU operations. In multi-plane systems that use a BPU per 
plane, each BPU can have its logic function programmed 
independently. This feature allows the RGP to BITBLT data 
to all bitplanes concurrently while maintaining the flexibility 
of having unique logical operations being performed on 
each plane. 
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Biock Diagram 
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Functional Block Description (continued) 
TABLE II. BITBIT Function Definitions 


Operation 
f0 Performed 


0 

sandd 
sand —d 
Ss 

—sandd 
d 

s xord 
sord 
—sand —d 
s xnord 


Function Select Bits 
f3 f2 f1 


“oa oi st Oo UCU CU LC HE CUO OO OO OO OO 0 


0 
0 
0 
0 
, 
1 
1 
1 
0 
0 
0 
0 
1 
1 
1 
1 


=3 00-242 00423200242 00 
=-“O--O--7r Or ors O-0O=-0+0 


Note: 
d: destination data 
$s: source data 
f3-f0: function select code, which selects one of the 16 
BITBLT functions. 


DATA INPUT LATCH 


The Data Input Latch (DIL) consists of a master latch, DIL- 
Master, and two slave latches, DiL-Source and DIL-Dest 
(see Figure 6). 


Data is latched into the DIL-Master latch on the falling edge 
of PH2 when DLE is asserted low. Data is then transferred 
into either the DiL-Source slave latch or the DIL-Destination 
slave latch on the next rising edge of PH1 depending on the 
state of the BSE input. Data is then latched when PH1 re- 
turns low. 


Each of the slave latches are used to direct the flow of data 
to either the source or destination pipe. When performing 
BITBLT, data from the source area is read and latched into 
the source pipe via the DiL-Source latch, aligned and stored 
in the FIFO. Once the source data operation is complete, 
data from the destination area is then directed into the des- 
tination pipe via the DIL-Dest latch to be combined, word- 
by-word, with the data stored in the source pipe FIFO. 


DO=15 


PH2 


DLE 


To Barrel input Latch 
& 


Barrel Input Mux 


DO D15 


DIL_Master 


BARREL INPUT LATCH 


The Barrel Input Latch (BIL) serves as a pipeline register for 
source data (see Figure 7) during a BITBLT source read 
cycle. Data, which is transferred into the DIL-Source register 
on the rising edge of PH1, is subsequently transferred into 
the BIL on the next rising edge of PH1. As a result, if two 
words are fetched and latched into the BPU on two subse- 
quent clock cycles (assuming BSE = 1), the first data word 
will be in the BIL and the second in the DIL-Source Latch. 
The BIL and DiL-Source outputs are then concatenated via 
a crossbar multiplexer to form the 32-bit input to the Barrel 
Shifter. 


It is important to note that valid data will remain in the BIL 
for only ONE clock cycle as the next PH1 cycle will again 
transfer data from the DiL-Source Latch to the BIL. This 
condition implies that the result from the Barrel Shifter must 
be latched into the FIFO on the rising edge of PH1 immedi- 
ately following the PH1 transition used to latch the second 
word into the DiL-Source latch. 


From DiL_Source 
Register 


16 


BIL1S SPO SP15 


Barrel Input 
Crossbar 
Multiplexer 


BIS 


Sn 


CO=C15 
To FIFO Input 
MUX 


TL/F/9337-23 
FIGURE 7. Barrel Shifter/Barrel Input 
Latch Block Diagram 


To Logic Unit 
& 


Pixel Multiplexer 
TL/F/9337-24 


FIGURE 6. Data Input Latch Block Diagram 
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Functional Block pial tea (Continued) 


BARREL SHIFTER 


The function of the Barrel Shifter is to align the BITBLT 
source data to the destination data. Bit netigoment may cross 
word boundaries. 


The Barrel Shifter is implemented as a 32-to-16-bit multi- 
plexer (see Figure 7). Depending on the type and length of 
a BITBLT operation the necessary source data words are 
fetched from memory into the BPU to form a 32-bit input to 
the Barrel Shifter. The Barrel input Latch (BIL) stores the 
first source data word fetched and the Source Data Input 
Latch (DiL-Source) stores the subsequent source data 
word. 


A multiplexer precedes the Barrel Shifter to facilitate swap- 
ping the input words if necessary. This word swapping is 
controlled by the state of the BIS bit in the Control Register 
and does not affect the ordering of the data bits of the input 
words. This swap mechanism facilitates the fetching of 
BITBLT data from left-to-right or from right-to-left. 


FIFO 

The FIFO, 16 bits by 16 words, constitutes the BPU’s on 
chip storage (see Figure 8). Implemented as a dual port 
register file, the FIFO has separate READ/WRITE control 
inputs, FIFO Read (FRD) and FIFO Write (FWR). The regis- 
ters are selected via two shift register ring counters which 


are incremented via the FRD and FWR inputs. These coun- 
ters are set to. zero with the RESET pin. 


The FIFO facilitates the use of fast localized memory ac- 
cess modes such as page and static column modes by pro- 
viding storage for multiple words of barrel shifted words. 


BITBLT LOGIC CONTROL UNIT 


The BITBLT Logic Control Unit (LCU) is responsible for the 
mask and bitwise logical operations performed on BITBLT 
and line drawing data (see Figure 9). 


in the BITBLT mode (L/B = 0), the LCU performs a bitwise 
logical operation between a source data word read from the 
FIFO and the Destination Data Input Latch contents. The 
logic operation performed is specified in the FS Register. 
The LCU is also responsible for masking the appropriate 
destination data bits from change during the logic operation. 
The masking function is determined by the values set in the 
LM and RM fields of the Control Register and the state of 
both LME and RME control inputs. 


During Linedrawing (L/B = 1), the LCU performs a bitwise 
logical operation between the contents of the Pixel Input 
Latch and a single bit of the Destination Data Input Latch 
contents. This bit is selected by the state of Pixel Address 
bits BO-B3. The logical operation function is determined by 
the FS Register contents. 


From Barrel From BITBLT 
Shifter Logic Unit 
16 16 
co C15 — BLO BL15 
FIFO INPUT 
BSE, 
16 
FINO 7 FIN1S 
FIFO ele 
16 BITS BY 16 : B3_FRD, 
WORDS RESET 
From BITBLT 
To , Logic Unit 
BITBLT Logic 16 16 
Unit Input. 
MUX 
DATA 
DOS OUTPUT MUX 
16 


To Data Buffers. 
FIGURE 8. FIFO Block Diagram 
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Functional Block Description (continued) 


From FIFO From Pixel Input 
Block Latch 


L/B 


From 
DiL_Destination 
Latch 


BO_LME; 


B1_RME, a 

B2_FWR, RM 

B3_FRD, _ 
L/a 


To Data Output MUX 
and FIFO Input MUX 


FIGURE 9. BITBLT Logic Unit Block Diagram 


TABLE Ill. Barrel Shifter Truth Table* 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


as 
awh 


wk Sk > ek 
OB © NY 


*| = shift number 

A0-A15 = barrel shifter most significant word 
BO-B15 = barrel shifter least significant word 
CO-C15 = barrel shifter output 
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Functional Block Description 
(Continued) 
PIXEL PORT 


The Pixel Port consists of a single-bit !/O port dedicated to 
pixel level operations. It consists of a Pixel Input Latch anda 


Pixel Output Latch (see Figure 10). Data is written to the - 


Pixel Input Latch by asserting PDLE. Data is read from the 
Pixel Output Latch by asserting POE. 


The Pixel Input Latch (PIL) is used to store a pixel value 
which is used as the source operand during linedrawing 
Read-Modify-Write cycles. Internally this bit is replicated 
into a 16-bit word and routed to the BITBLT Logic Unit 
where it is masked and logically combined with a word read 
from the destination space. The Pixel Address inputs 
(BO-3) determine which bit of the destination word will be 
combined with the pixel port data. The PIL must be loaded 
prior to performing any linedrawing or pointwise drawing op- 
erations and L/B must be set high for proper operation. 

The Pixel Output Latch (POL) is used to read a single-bit 
pixel from the frame buffer. This is accomplished by loading 
the appropriate destination word to the DIL-Dest latch (BSE 


must be asserted low while using this function). A single bit © 


of this word is selected based on the value of the Pixel 
Address inputs (BO—3) and routed to the Pixel Output Latch. 
This bit is unmodified when read from the frame buffer. _ 


Operational Description 


The BPU’s primary function is to perform two types of data 
manipulation tasks: 1) execute shift, mask, and logic opera- 
tions between source and destination data blocks (BITBLT 
operation); 2) logically combine line drawing data with desti- 
nation data (Line Drawing operations). 

These operations are carried out in the BPU via 3 major 
functional blocks: The Barrel Shifter, the FIFO, and the 
BITBLT logic Unit. These blocks in combination with a vari- 
ety of latches and multiplexers synchronize, modify and 
route the data through and around the device. 


PH1 PDLE 


DLE, 


Dataflow through the BPU is dictated by control parameters 
previously set up in the Control and Function Select Regis- 
ters and by the state of various control inputs on the device. 


BITBLT OPERATION 


The mechanics of a BITBLT operation are relatively straight- 
forward. Following Figure 17 and Figure 12, a source data 
word is fetched from memory and latched into the Data In- 
put Latch (DIL-Source) via the DIL-Master latch. A second 
source data word is subsequently fetched and latched into 
the DiL-Source latch while the first source data word is 
simultaneously transferred to the Barrel Input Latch. This 
makes both the first and second source data words avail- 
able to the Barrel Shifter as a 32-bit data word. This 32-bit 
quantity is routed through a crossbar mux and into the Bar- 
rel Shifter which produces an aligned 16-bit source word. 
This data is latched into the FIFO on the next clock cycle. 


The data is then read from the FIFO on a subsequent clock 
cycle and routed to the BITBLT Logic Unit where it is 
masked and logically combined with a destination data word 
previously fetched and latched into the Dil-Destination 
latch. The data is then routed back to the Data Port (DO- 
D15) so it can be written back to the destination data mem- 


ory. 


LINE DRAWING OPERATION 


During a line drawing operation (see Figures 13 and 74), the 
‘source’ data, referred to as the ‘Pixel Input Data’, actually 
comes from the Pixel Input Latch (PIL). The PIL input data is 
latched into the BPU’s PIL prior to the line-drawing opera- 
tion. The destination data is then fetched from memory and 


_ written to the DiL-Destination latch (BSE must be asserted 


low). The pixel input data bit is internally replicated into all 
16-bit positions of a data word and routed to the BITBLT 
Logic Unit where it is logically combined with the value in 
the DiL-Destination. Since only one bit of the destination 
word is to be modified with pixel data, a mask is 


From DiL=Dest. 
Latch | 


B[3:0] 
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FIGURE 10. Pixel Port Block Diagram 
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Operational Description (Continue) 


generated based on the state of the pixel address lines 
(BO—B3) so that all other bits of the destination word are 
protected from change. The resulting data word is then rout- 
ed back to the Data Port (DO0-D15). 


In multiple-plane color display systems that utilize a BPU per 
bitplane, the Pixel Input Latch is commonly used to deter- 
mine color value during linedrawing. This is accomplished 
by loading the color value across the depth of the planes. 
This technique is also useful for rapidly downloading image 
data that is stored in a word-per-pixel format into a frame 
buffer utilizing a planar architecture. 


The Pixel Output Latch reflects a single pixel data value 
derived from a 16-bit word previously loaded to the DIL-Des- 
tination latch. The Output Pixel Data value is one-of-16 bits 
of the destination data word selected by the pixel address 
lines BO—B3 and is purely unmodified destination data. This 
function is useful for reading pixel color data in multiple- 
plane systems that support image processing capabilities. 
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TABLE IV. DP8500 RGP Drawing Instructions 
that utilize the DP8511 BPU 


MOV s,FSE Load Function Select Register 
(FSE = Address of FSE Register) 
MOV s,PIL Load Pixel Input Latch 
(PIL = Address of Pixel |/O Port) 
MOV POL,d Read Pixel Output Latch 


(POL = Address of Pixel !/O Port) 


RDPT Read Point 
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Operational Description (Continued) 
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FIGURE 11. Dataflow during a Typical BITBLT Operation. (RGP Instructions: BT, FILLA, DC.) 
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Operational Description (Continued) 


DO=15 PDn 
ZY 1 
peenieees > lesa wy 
jane souWes< La sae PIL | 
FS-ptesey pt cet te pL ener] : 
i i ce ( (III, 16 
Fs ' BIS Sn Lm Rm pevescuceus = 
CRE-e eon ‘ est, aura] DiL_Source ‘ a ae oper | = 
Uy = 
ear oe WH = 
DLE A faa ? j = 
| Eevasqnese? = 
woes ’ Y = 
BIS--=-p} = MUX : Yj | = 
tewpoceopad Yj = 
a; 4 WH = 
aes See Om Z = 
BARREL = Shm=sp j = 
seelsesers Yj = 
Y = 
7 + aes aaa y eee eee ‘ = 
ere Se y : = 
ae BSE--->) MK j ! = 
iE -- 3 tea t= POLE; H Yj | = 
paeee tosses Y) = 
BSE == <4 procs a PBSE, RESET === pF y = 
pee. wee B2FWR----pt FIFO Yj = 
B3_FRD,--- >} ' g = 
Y E 


Saeweqesae 


9 
§ 
‘ 
$ 
’ 

t 
eo 

’ 

J 

5 

Q 

Q 
fi 


PIXEL INPUT DATA 


Pe VT ew 2 @ 2 @ BZ @e2wowoeneoweoe 
3 
ebeewewuaeeaeewaeeeneeenwees © @ @ 


BO_LME, : 
LLL, DESTINATION DATA B1_RME, Ay am 
B2_FWR, 7 
Rees PIXEL OUTPUT DATA B3_FRD, Fs 
quENENER PIXEL OP DESTINATION L/B ---- 
' 16 
wre meme ene , te f= 
BOE \/ POE \/ 
DO=15 PDn 


TL/F/9337-4 
FIGURE 13. Dataflow during a Typical Line Drawing Operation. 
(RGP Instructions: RDPT, DRPT, DRLN, DRLNS, DRPGN, DRPLN.) 
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Operational Description (Continued) 
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FIGURE 15. Pipelined BITBLT Example for a Transfer of a Single-Width, Non-Aligned Horizontal Line of at Least 16 Words in Length 


Systems Applications 


A typical system example illustrating the BPU’s interface to 
the RGP and system elements is shown in Figure 76. Includ- 
ed in the example is National’s DP8500 Raster Graphics 
Processor, the DP8520 Video Ram Controller/Driver and 
the DP8511 BPU. A user defined state machine is used to 
control the dataflow between the RGP, BPU and Frame 
Buffer. 


Since the BPU and RGP are members of an integrated fami- 
ly of devices, they have been designed to minimize the need 
for external glue logic between them. The user is left the 
task of defining the interface from the RGP/BPU to the 
frame buffer, tailoring it to the system needs. 

STATE MACHINE TASKS 


Referring to Figure 16, the state machine performs the fol- 
lowing tasks: 


1. Determines the start of an RGP memory cycle. 
2. Determines the type of access to be performed. 


3. Generates the necessary timing required to carry out the 
access including wait state generation. 


. Generates refresh timing signals. 
. Generates BPU Control Register strobe (CRE). 
. Generates BPU Function Select Register strobe (FSE). 


. Generates the DOE, DLE, POE, and PDLE strobes to the 
BPU. 


The memory address of the Control Register and Function 
Select Register are user defined locations in the RGP mem- 
ory space. It is therefore necessary to generate the CRE 


NOOO fb 


RGP ADDRESS BUS 


STATE 
MACHINE 


and FSE strobes in order to select these registers. The DOE 
and DLE strobes must also be generated by the user since 
the timing characteristics of these signals are determined by 
the memory access timing requirements. 


MULTIPLE BITPLANE ARCHITECTURE 


The Advanced Graphics Chip Set (AGCS) architectural rela- 
tionship, in particular that of the RGP/BPU devices, is such 
that multiple plane memory systems can be developed with 
relative ease and flexibility. The purpose of segregating the 
BITBLT and Line Drawing functions into a separate device 
(the BPU) becomes evident when studing the implementa- 
tion of a multi-plane display system. Figure 77 illustrates an 
approach to designing a multi-plane system. 


In any BITBLT based display system, it is necessary to be 
able to move data rapidly between two areas of memory 
without restrictions imposed by memory organization. In 
multiplane systems it is necessary to allow for data opera- 
tions to occur between planes of memory as well as locally 
within the bit-plane. The ability is governed by the logic in 
the video plane control block. 


lf a BPU is incorporated in each bit-plane of memory and 
the plane controller is designed correctly, data transfers can 
be performed in parallel across n-planes of memory, be- 
tween specific bitplanes, as well as locally on each bit- 
plane. As an added bonus, this design approach allows the 
system to be expanded to any practical number of bit- 
planes while maintaining the performance and throughput 
characteristics of a single plane system. 
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SYSTEM DATA BUS 


TL/F/9337-5 


FIGURE 16. DP8500 System Interface 
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FIGURE 17. Typical RGP-Based Multiple Bitplane System 
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TL/F/9337-31 
a)Simple Microprocessor Interface Utilizing Software for Address Generation 
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TL/F/9337-41 
b)Microprocessor Interface with Dedicated Hardware Controller for BITBLT and Linedrawing Operations 


FIGURE 18. Design Examples for Microprocessor-Based System 
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BPU APPLICATIONS UTILIZING A CONVENTIONAL MI- 
CROPROCESSOR ae 


If desired, the DP8511 can be used in conjunction with a 
conventional microprocessor. In this application, the BPU is 
operated under the control of the microprocessor and a 
small dedicated timing state controller which produces the 
basic memory and BPU timing control. The microprocessor 
would be responsible for interpreting the BITBLT or line- 
drawing setup parameters and generating the appropriate 
address and timing contro! while the BPU would serve as 
the BITBLT or linedrawing data processor. Implementation 
can vary as to which level of hardware integration is chosen 
versus doing the drawing and BITBLT functions in software. 
This partitioning is ultimately based on the performance lev- 
el targeted for the application. 


Two examples of this partitioning is illustrated in Figure 78. 
In Figure 18a the drawing engine consists of a microproces- 
sor, a small memory/BPU timing generator, the frame buffer 
and the BPU. In this application, the microprocessor is re- 
sponsible for calculating the address, mask, and other relat- 
ed setup parameters as well as generating the addresses 
and mask enables used for the memory operations between 
the frame buffer and the BPU. The timing controller would 
decode the microprocessor’s memory read/write opera- 
tions and generate the appropriate memory and BPU con- 
trol timing. As suspected, the drawing performance of this 
solution would be based on the microprocessor’s |/O per- 
formance. 


Another approach to this design (see Figure 18b )would be 
similar in concept except would partition the functionality so 
the address and mask enable generation would be part of 
the memory/BPU timing generation hardware. In this case, 
the microprocessor would caiculate the setup information 
only and transfer it to a dedicated hardware address con- 
troller which would perform the drawing memory cycies in 
conjunction with the memory/BPU timing controller. In this 
way the hardware would be able to take advantage of per- 
formance features like page mode memory access and in- 
terleaved setup and drawing execution. 


DETERMINING BITBLT PARAMETERS 


Before a BITBLT operation can take place, it is necessary to 
calculate a number of parameters that will be used to con- 
trol the BPU’s activity. These parameters include: 


1) BIS value, 


_ 2) The Shift Number (SN), 


3) The Left Mask value (LM), 
4) The Right Mask value (RM). 


These values can be calculated by using the following pa- 
rameters normally specified prior to starting the transfer cy- 
cle. 

These are: 


1) BITBLT Source Starting Address (BSS), 

2) BITBLT Destination Starting Address (BDS), 

3) BITBLT Width (BLW), 

4) Source Warp (vertical word address offset), 

5) Destination Warp (vertical word address offset), 
6) and the BITBLT direction. 


Many of these parameters are used to calculate the inner 
and outer loop values for the BITBLT control logic. These 
are, for example, used as comparator values for the hori- 
zontal and vertical counters that control the BITBLT opera- 
tion. In addition, these parameters are used to calculate the 
Control Register values for the BPU. 


When using the BPU in conjunction with the DP8500 Raster 
Graphics Processor, all of the setup calculation, Control 
Register loading, and BITBLT control logic used to generate 
the drawing cycles remains transparent to the user. 


When using the BPU with a microprocessor or hardware 
controller, it will be necessary to calculate all variables 
which are used to program the hardware for appropriate 
operation. The following sections will help in determining 
these values. 


- ASimpie Inner/Outer Loop Example 


lf a dedicated drawing machine is implemented, the logic | 


necessary would most likely be divided into two parts: a 


BITBLT controller and a linedrawing controller. Each of © 


these would be responsible for generating the address and 
control for the memory cycles to be performed. The logic for 
a BITBLT controller is relatively easy to design since it con- 
sists mostly of counters and comparators that control the 
inner and outer loop for the block transfer. A linedrawing 
controller, on the other hand, involves the design of a dedi- 
cated machine based on a linedrawing algorithm (i.e., Bres- 
senham’s linedrawing algorithm). Since this functionality is 
easier to realize using a software solution, a hybrid solution 
could be implemented which would assign any linedrawing 
tasks to the microprocessor and the BITBLT tasks to a 
hardware controller. 
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In anon-RGP based system, a mechanism must be defined 
that is responsible for generating the timing and control for 
the BITBLT operation. This mechanism may be as simple as 
a microprocessor functioning in tandem with a small state 
machine that generates the BPU and memory control sig- 
nals. The microprocessor would be responsible for generat- 
ing the address information. 


A more elaborate solution could be a dedicated state ma- 
chine that generates the BPU and memory control signals in 
addition to performing the address generation. 


In either case, some intelligent processor will be required to 
calculate the BPU’s control parameters. Determining the 
level of hardware sophistication for the BITBLT controller 
versus doing some portion of the control with a microproc- 
essor is solely dependent on the targeted performance lev- 
el. 


Below is an example of a simple inner/outer loop that de- 
scribes how a BITBLT controller might be implemented. The 
Starting Addresses and width values are specified in pixel 
values while the warps are specified in word values. 


System Applications (Continued) 


/* AN INNER/OUTER BITBLT LOOP EXAMPLE * 
/* to show how a BITBLT is performed. */ 
/* */ 
Static int 
BSS, /*BITBLT Source Starting Bit Address */ 
BSW, /*BITBLT Source Word Address (BSS/16) */ 
csc, /*current Source address */ 
BDS, /*BITBLT Destination Start Bit Address */ 
BDW, /*BITBLT Dest. Word Address *(BDS/16) */ 
cdc, /*current Source address */ 
BLW, /*BITBLT Width -l */ 
BW, /*BITBLT Width in words */ 
BLH, /*BITBLT Height -1l */ 
we, /*down counter for horiz. width (in words) */ 
vc, /*down counter for vert. height (in words) */ 
Start, /*flag that starts the BITBLT operation */ 
Stop_bitblt, /*This concludes the whole thing */ 
BSWRP, /*BITBLT Source Warp */ 
BDWRP, /*BITBLT Destination Warp */ 
ldst, /*horiz end-of=-line flag */ 
fdst, /*horiz start-of-line flag */ 
rme, /*right mask enable */ 
lme, /*left mask enable */ 
dir, /*Direction of BITBLT */ 
/*O0 = left to right, 1 = right to left */ 
Eob; /*end of horizontal line flag */ 
main() 
{ 
/* Calculate BITBLT parameters such as: 
Height 
Width 


Shift Number 
Left and Right Masks 
Direction of BITBLT 
etc. 
and download to BITBLT control logic. 
i 


[78 EE He He He ee ee ee ee a a ee ae ae ae ae a ae a ok 
/* BITBLT function routines * / 
fH He ee ee ee eee ae ea ee ae ae ae ae ae a 


do_bitblt_outer_loop() 
{ 


do_bitblt_outer_loop(); 


if (start) 

{/*Calc. the initial source starting word address */ 
BSW = BSS>>4; /* BSS/16 */ 
csc = BSW; 

/*Cale. the initial destination starting word address */ 
BDW = BDS>>4;3 /* BDS/16*/ 
cdc = BDW; 

/*Calculate the initial width of transfer value */ 
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BW = BLW >> 4; 
if (BUN & Oxf) 

BW = BW +13; 
j 


while (stop_bitblt == 0) 


/* do this until the height is finished */ 

if (start) 
vc = BLH; /* set initial vertical count */ 

fdst = 1; 
do_bitblt_inner_loop() ; | 

if (Eob)/*if at end of horiz. line then reset values */ 
{ 


ve = VC -l; 
Wc = BW; /*reset horizontal counter */ 


BSW = BSW + BSWEP; 
cdc = BSW; 
BDW = BDW + BDWRP; 
cdc = BDW; 
Eob = 0; 
ldst = 03 
} 
if (VC == 0) 


Stop_bitblt = Oxfffr; 


} 
do_bitblt_inner_loop() 
{ 


while (Eob == 0) 


/* BITBLT horizontal down-counter */ 
if (start) 


WC = BW;/*set initial horizontal cnt */ 
Start = 0; /*reset flag */ 
j 
if (WC == 1) 
ldst = 1; 
do_source_cycle() ; 
set_mask() ; 
do_dest_cycle(); 
fdst = 0; 
/* Keep track of horiz. count */ 
if (Eob == 0) 
WC = WC -1; 
/* Flag to indicate end-of=horizontal line */ 
if (WC == 0) 
Eob = 1; 
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do source _cycle() 


{ 
/* 


This routine is the hardware control logic 


that reads the source data from memory and 
Strobes it into the BPU and controls the transfer 


into the FIFO. 
*/ 


} 
do_dest_cycle() 
{ 


csc = cSc + 1; 


/* This routine is the hardware control logic that generates 
the appropriate address and control signals to read 
the data for the destination cycle. 

a 
ede = cde + 1; 

} 

set_mask () 

{ 

/* 


1) if the BLT is from right-to-left and the last word of the 
horizontal line is being transferred, then enable the left mask. 


2) if the BLT is from left-to-right 
horizontal line is being processed, 
3) if the BLT is from right-to-left 
horizontal line is being processed, 
4) if the BLT is from left-to-right 
horizontal line is being processed, 


*/ 
lme = (dir & ldst)|(~dir & fdst); 
lme = (dir & fdst)|(~dir & ldst); 


} /*end of inner-outer example */ 


Barrel Input Select (BIS) 


One item to determine is the BIS value. This is done by 
selecting the direction that the BITBLT is to take place in. 
Then the following procedure should be used to resolve the 
BIS value: 


Calculate _BIS_Value () 
{ 
Static short 


BIDR, /*BITBLT down and to the right */ 
BTDL, /*BITBLT down and to the left */ 
BTUR, /*BITBLT up and to the right */ 
BTUL, /*BITBLT up and to the left */ 
dir, /*direction data is read from memory 

/*O=left-to-right, l=right-to-left */ 
BIS, /*BIS value to be used */ 


set_bis,/*flag that forces BIS high */ 


LM, /*left mask value */ 
RM, /*right mask value */ 
SN, /*4-bit shift number */ 


/*when BITBLTing to=-the=-right... */ 
if (BTDR|BTUR) 

dir = 0; 
/*when BITBLTing to-the-left... */ 
if (BIDL|BTUL) 


dir =l; 


*/ 
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and the lst word of the 
then enable the left mask. 
and the 1st word of the 
then enable the right mask. 
and the last word of the 
then enable the right mask. 


/*if the shift number is zero then always 

set BIS to one. This is done to Save reading 

the source data twice */ 

if (SN = 0) 
Set_bis = 

BIS = dir|set_bis; 

} 


Shift Number (SN) 


The shift number is calculated by subtracting the 4 least-sig- 
nificant bits of the Destination Address from the 4 least-sig- 
nificant bits of the Source Address. 


Calculate_Shift_Number() 
{ 
Static short 


is 


oh 


BSS, /*Source Starting-Address */ 
BDS, /*Destination Starting Address */ 
SN, /*Resultant Shift Number */ 


SN = (BSS - BDS) 
j 


& Oxf; 
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Left and Right Masks (LM, RM) 


The masks are calculated by using the destination pixel ad- 
dress in conjunction with the BITBLT Width value and the 
direction of the BITBLT. Generally speaking, the masks are 


enabled only when operating on either of the two words that . 


bound a scanline of BITBLT data. For example, when 
BITBLTing from right-to-left, the right mask is enabled by 
detecting the start of a new BITBLT destination scanline 
while the left mask is enabled when the leftmost word of the 
BITBLT destination scanline is being processed. 


Calculate_Masks () 
{ 


static short 


RM, /*Right Mask Value */ 
LM, /*Left Mask Value 
BDS, /*BITBLT Destination Starting Addr */ 
BLW, /*BITBLT Width ~— 1 */ 
if (dir = 0) /* left-to-right BITBLT */ 
{LM = BDS & Oxf; 
RM = (BDS + (BLW-1)) & Oxf; 
} 
if (dir = 1) /* right-to-left BITBLT */ 


{LM = (BDS — (BLW-1)) & Oxf; 

RM = (~BDS & Oxf) + BLW; 

j 
j | | 
If additional masking is being generated, for example, a clip- 
ping mask, then this must be taken into consideration when 
establishing the boundary mask values since the clipping 
mask may affect the boundary masks values. 


Additional Considerations 


Another item that must be determined in order to achieve 
the maximum BITBLT performance is whether or not two 
source read cycles are necessary during the source opera- 
tion to align the source area to the destination area. This 
decision can be based on the following criteria: 


1) If the pixel starting address of the source is less than the 
pixel starting address of the destination and the BITBLT 
direction is from right-to-left, then it will be necessary to 
read two adjacent source words during the source read 
cycle. This is because the alignment process requires a 
portion of the adjacent word in order to makeup a full 16- 
bit value for the destination RMW cycle. 
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2) If the pixel starting address of the source is greater than 
or equal to the pixel address of the destination and the 
BITBLT direction is from left-to-right and the shift number 
is greater than zero, then it will be necessary to read two 
adjacent source words during the source read cycle. 
Again, this is because the alignment process requires a 
portion of the adjacent word to complete the alignment 
to the destination. 


Upon evaluation of the above statements, it is safe to de- 
duct that when shifting the source from right-to-left, two 
source words must be read in order to align the source to 
the destination properly. Conversely, when shifting the data 
from left-to-right, one source word must be latched into both 
word positions of the Barrel Shifter so that an effective ro- 
tate right can be executed. This can be done by allowing an 
additional clock cycle to occur after latching the data into 
the Data Input Latch which will transfer a copy of the DIL’s 
contents to the Barrel Input Latch. The shift value in this 
later case is (BSS — BDS) & Oxf. 


BITBLT APPLICATIONS 


The following section gives a detailed analysis of how a 
BITBLT operation is executed using the DP8511. The objec- 
tive is to copy a 20-bit wide single height source block from 
pixel address 0002h to a non-aligned destination pixel ad- 
dress of 0805h using an “AND” function code and in a left- 
to-right BITBLT direction. Since both the source and the 
destination data areas are not aligned to a word boundary, it 
becomes necessary to shift the source data so its left-most 
boundary is aligned to the bit address of the targeted desti- 
nation area. Also, since the width of the BITBLT block is 20 
bits, the operation will involve two horizontally adjacent 
words. Because both words are considered to be boundary 
words it will be necessary to perform Read-Modify-Write cy- 
cles for each word during the destination portion of the op- 
eration. 


Calculating how to align the source to the destination area is 
essential to determining the shift, left, and right mask val- 
ues. See the section, Determining BITBLT Parameters, for 
more information on how to calculate these parameters. 
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Word Address 0 Word Address 1 
Pixel Address—>|0 12 3 45 678 9A8BCODE FIO 12345 6789<ABC ODO CE F 


VLOVOPETOLOTTTELOLODITEEPLTEESOTTOTOOBETEOLOLETTEOTEOIELETELOLESELILOLISISSOLEEOLEDIGLESOOLTLEEOTIIEDOEOLILIOLEIIPE, 


SELIIPPLOOTEL IF, 
ehhh LOLOL LITETELOLELPOOPLOLOEPEPLETIPEEES 


keh h hhh dhe bh hd ddA AAA LAA heh hhh ddddddddbdddddddtddditddditditd 


a 20 Bit Wide Source aig ee 
= Word Address 80h Word Address 81h 
01235456789 ABCODEFIO123456789ABCODEFPF 


SELOOLPLELOLELIOLEOLSSSES. 
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CLOLULEIEEODLDPDODERILEPLPLOLELEOLEPIIPPPILPOLEOLEOOESESSISOPEPSSEDILLEPLELLILEPIEEELS ESP EPEEEEOLEOLOLEDDEEBEDEOLDPPLETELIOLLED 


pcs Sat Destination Space a eee 


TL/F/9337-33 
Procedure: 
1) Load Function Code to FS Register, value equal to 1 (“AND” opcode). 


2) The source data must be rotated 3 positions to the right so it will be aligned to the destination space specified. Since the BPU 
cannot do a right shift the data must be shifted 13 positions to the left to achieve this effect. In order to rotate the 1st source 
word, it must be present in both the Barrel Input Latch (BIL) as well as the Data Input Latch (DIL). This is achieved by allowing 
the data to “‘sit’”’ in the DIL for an additional clock cycle. This will allow the data to transfer into the BIL on the clock cycle 
following the one used to latch the data into the DIL. The shift number must be set to 13 (SN= 13). 


3) The BIS value is also set at this time. Its value is chosen to be 0 since data is being fetched in a left-to-right sequence and we 
don’t want to swap the data going to the Barrel Shifter. 


4) The first data word is now latched into the Data Input Latch and one clock cycle later falls into the Barrel Input Latch. This will 
present a 32-bit wide data word to the Barrel Shifter. 


Word Address 0h Word Address Oh 
0123 45 678 9A8B C DE F 23456789 A BCD CE F 


SUEIEIILLEPLTOETETEETELEOLEPELOSEETEOTELLLEOTLEODELEPOLISEEEEBEDEOELBEMLED OLE LEDELOEE ED 


EPL OLEPLOLEETELELETOEPIOLOPLEOS TEEPE, 


o 1 
ze 


ee hbbebdddbdihdideddddddddddddédtkdttdddddddiddddddisitdddidisiadisiiidsiiidaiiE¢idisiaiiaa 


 pbhdhehhbhdhhddbddddeddeddddbdthddidétidddbddttddditdddddddddddddddedddedtdddtddedtdddddddea 


1st Source Word Fetched ist Source Word Fetched 
and written to Data Input Latch falls into the BIL one clock cycle later 
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5) Data enters the Barrel Shifter and is shifted to the left 13 bits (effectively a right shift of 3). The aligned source is then latched 
into the FIFO on the next clock cycle by asserting FWR. 


Aligned oe esl 
0123 45 678 9 AB‘CO CE F 


1st Aligned Source word 
(Write to FIFO) 


This portion will eventually 
be combined with the ist 


destination word 
TL/F/9337-35 
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6) The next task is to get the portion of the source data that is to be combined with the 2nd destination word. This will again 
require the source to be rotated to the right 3 bits. This is effectively a left shift of 13 bits. The shift number must be set to 13. 
Actually, this value is the same from the first alignment process so it’s not necessary to set it again. 


7) The second source word (word address 1) is latched into the DIL and is concatenated with the 1st source word which has 
been sitting in the BIL. 


, Word Address Oh . 3 - Word Address 1h 3 
0123456789 ABCODEF a |041 23-45-6789 £2 BCD EF 


VEUTLIILOMIETOLELOIEEELOTTETEDEOOOELPAIDTODELELEDEOELEEET TELE OIOOTAEMEOETEEEOEEO ETE 


ck heh hehadachachatacdahoaiechcadacdacachehedocdh dich chchechachechecdadecdacdchocdidhediddeeddedeedectedaedihedadckdbkceddhehddhddedechhdddhddde’ 


2nd Word Previously 2nd Source Word Fetched and 
Fetched Sitting in the BIL Written into Data Input Latch 


Poet hi ttiiitiihiitintiitittiidisisin 


Lhhidddiitititdii¢dh¢iiisititddddditéd 
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8) Data enters the Barrel Shifter and is shifted 13 places to the left which effectively performs a right shift of 3 positions. This 
aligns the second source word for combination with the second destination word. The aligned source is then latched into the 
FIFO on the next clock cycle by asserting FWR. 


ers Aligned Source 
0123 45 678 9 ABCD €E F 


2nd Aligned Source Word 
(Write to FIFO) 


This portion will eventually 
be combined with the 2nd 


destination word 
j TL/F/9337-—37 


9) Now that the source data has been aligned, the next step is to read the two destination words and combine the source data, 
which is in the FIFO, with them. This requires that the masks be set before doing the two read-modify-write operations. The 
first destination word has 5 bits on the leftmost side that needs to be masked from alteration. The second word has 7 bits on 
the rightmost side that need masking. The two mask values can be loaded to the RM and LM bits in the Control Register at 
one time and are enabled when needed by using LME and RME. LM is set to 5, RM is set to 8 (complement of 7). These 
values actually should get written to the Control Register at the same time the SN values did to save the unnecessary write 
cycles. 


10) Set BSE = 0, LME = 1, RME = 0, and read the first destination word (Adr = = 80) into the Data Input Latch. The data will 
fall into the destination pipe and be routed to the Logic Unit. 


11) The first aligned source word is read from the FIFO by asserting FRD. This data is “AND’ed”’ with the destination data in the 
BITBLT Logic Unit and the result appears on the DO-D15 lines. 


1-112 


System Applications (Continuea) 


a 1st Aligned Source Word rea Word Address 80h 
012345 6789 ABCD CE F 0123456789 ABCODEF 


7777 Men Bee Be 
1st Aligned Source Word Left Mask 1st Destination Word, fetched 


read from FIFO Applied here and written into Data-Input Latch 
ae Logic Operation takes place ae 4 


between these data areas 


Word Address 80h 
012345 678 9 ABCODE F 


Result of logical "AND" operation 
on ist Destination Word (value at 
data port D0-15) 
TL/F/9337-38 
12) Set LME = 0, RME = 1, and read the second destination word (Adr = 81h) into the Data Input Latch. Again this data will 
be routed through the destination pipe to the Logic Unit. 
13) Read the second aligned source word from the FIFO. This data will be ‘““AND’ed”’ with the destination data in the BITBLT 
Logic Unit and the result appears on the DO—D15 lines. 


14) The transfer is complete. The destination space has been modified as shown below. 


2nd Aligned Source Word Word Address 8th 
012345 6789 ABC OD CE F 012345 678 9ABCODE F 


eS ILLLILIIL LIL SELL Ia TILT LILIES ET EIOOLOIOED EES, 


oe 7 


—hdhddiidhdkidhdilidididehehdhddddhidhdedd tgh 


2nd Destination Word 
fetched and written Right Mask 
into Data Input Latch Applied here 


2nd Aligned Source Word 
read from FIFO 


Logic Operation takes place: 
between these date areas 


Word Address 80h Word Address 8th 
012345 67898Ai#sB(: CO E€ FJ0O 12345 6789 AB COD €E F 


ene rr Destination Spane Se 


TL/F/9337-39 
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Absolute Maximum Ratings 


lf Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales 
Office/Distributors for availability and specifications. 


Temperature under Bias | 
Commercial | 
Military 

Storage Temperature Range 


All Input or Output Voltage 
with Respect to GND 


Power Dissipation @ 20 MHz 
ESD rating to be determined. 


—65°C to + 150°C 


—0.5V to +7V 
0.5W 


DC Electrical Characteristics 


0°C to + 70°C 
—55°C to + 125°C © 


Note: Absolute maximum ratings indicate limits beyond 
which permanent damage may occur. Continuous operation 
at these limits is not intended; operation should be limited to 
those conditions specified under DC Electrical Characteris- 
tics. ao ane 


Ta = 0°C to +70°C Commercial (Ta = —55°C to + 125°C Military), Voc = 5V +10%, GND = OV 


Symbol | ____——Characteristics, | Conditions| Min_| Typ | Max | Units 
Me ft ec to |v 
a a O'S SD 
Vou | MOSClockHigh | PHY PH2Pins Only, MOS | Vec-05| | Voctos| Vv 
Vor___| MOSClockLow CL PHY PH2PinsOnly,Mos| -o5 | | 03 | Vv 
Vout PHI,PH2PinsOny,mos | -08 | | 05 | V 
Vion | TTLGlockHigh | TTL Clock/PHIOMy | 2s | | | 
Vrer__|TTLGlocktow | TTL Clock/PHIOMy =| | S| oe | 
Vor lon = —3mA p24 | | |v 
Vo [| tt ma | Ts 
In| LeakageCurent | v= Vor | | | tt 
loz__| TRISTATE® Leakage | Vo = VocorGnD | | S| tt0 | 
Iccr_| QuiescentCurent | PHY PH2at2oMHz = ||| 12 | mA 
loco __| SupplyCurent | PHY PHZattookHz =| || toma 
lcca__| SupplyCurent | PHY PHZatzoMHz == | ||| mA 
Own fin at 1 MHz rr ee ee 
Thermal Resistance - Ceramic PGA Package fp 
OIA Junction to Ambient °C/W 
95c Junction to Case °C/W 


Note: All output test conditions are 50 pF plus one TTL load. 


5.0V 


TL/F/9337-7 
FIGURE 19. BPU Output Test Load Circuitry 


TL/F/9337-8 
FIGURE 20. BPU Output TRI-STATE Test Load Circuitry 
Note 1: C1 = 50 pF 


Ri = 6ka 
R2 = 1.3 ko 
R3 = 1.8 kN 


Note 2: Connect SW to +5V for tpLZ and tpZL measurements. 
Note 3: Connect SW to GND for tpHZ and tpZH measurements. 
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fmos__| | PH1,PH2MOSClock Frequency | TOS=0 || 20. MHz 
temosy |__| -PH1,PH2MOSClockPeriod S| TOS=0 | 5 | 
fu | | PHITTLCIock Frequency | ToS=1 |S Ma 
tom | | PHITTLOlockPeriod ss | TOS=1 | S| 
ton | 22 | PHiHighTime(TcS=0) | RESO%toNextrESO% | 19 | 
tom | 22 | PHtHighTime(TCS=1) | RE.SVtoNextFE1.SV | 25 | 
tom | 22 | PHiLowTime(ics=1) | FE1.SVtoNextREtSV | 25 | 
tone | 22 | PH2HighTime(TCS=0) | RESO%toNextFESO% | 19 | 
tar | 22 | SO%PHIREtoSOmPH2RE =| | | 
tro | 22 | SO%PHZREtOSOMPHIRE | | | 
tnovi | 22 | Non-OverlapTime | PH2-to-PH1 50% ee 
tnove__| 22 | Non-OverlapTime | PHItoPH250% = || 3 | 
ta | 23 | DataSwitchingTime | FromL/B,Dos1.sv | 43 | 
test | 24 | DLE,PDLESetupTime(Tcs=0) | BeforePH2FESsO% | 10 | 
test DLE, PDLESetupTime(TCS=1) | BeforePHtRETSV | 10 | | 
tna DLE,PDLEHoldTime(Tcs=0) | afterPH2FESo% | to | 
tintr | 24 | DLE,POLEHoldTime(Tcs=1) | AfterPHIRE15V | to | | 
tis | 24 | DLE,PDLEInvaid =| BeforePH2RESOM =| 5 | 
tasat__| 24 | Data/ControlSetup Time (TCS=0) | BeforePH2FESsO% | 8 | 
tanat__| 24 | Data/ControlHoldTime(TcS=0) | AfterPH2FESO% | 8 | 
tistr__| 24 | Data/Control Setup Time (TCS=1) | BeforePHIRE15V | 5 _| 

tune | 24 

totoot 24 | RESET Setup Time (TCS =0) 

totint 24 | RESET Hold Time (TCS=0) | AfterPH2FE50%  —s_| 


nh 


4 Before PH1 RE 1.5V 


> 
+ 
co) 
| 
v 
ok 
N 
Tl 
m 
16.) 
oO 
PS 


= 
oi © 


totistr RESET Setup Time (TCS= 1) 
totintr 24 RESET Hold Time (TCS =1) After PH1 RE 1.5V 
BSE Setup Time (TCS=0) Before PH2 FE 50% 
BSE Hold Time (TCS = 0) After PH2 FE 50% 


BSE Setup Time (TCS= 1) Before PH1 RE 1.5V 


tbses2f 24 
toseh2t 24 
tbsesir 24 


— 
oO 


| Before PHIRE1.5V 

| AfterPHIRE1.5V 

| BSESetupTime(TCS=0) | Before PH2FE50% 

| BSEHoldTime(TCS=0) | After PH2FE50% 

| BSESetupTime(TCS=1) | BeforePHIRE1SV | 

tosontr | 24 | BSEHoldTime(TCS=1) | AfterPHIRE1SV | 
tear | 25 | 50 | 
tipzu | 27 pete 
tapz,_| 27 | 25 
tapz | 27 ea 
tiiz | 27 ae 
tow oe a’ 
ts al 
th a 
tdatr 
tigtr | 28 
tvbses ca ae 
tw tte | tte _| 
tir ac | 
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DP8511 AC Electrical Characteristics* (Continued) 


Notes: | 

*Conditions , 

Commercial: Ta = 0°C to +70°C, Voc = 5V +10%, GND = OV 

Military: Ta = —55°C to + 125°C, Voc = 5V +10%, GND = OV 

All values are in nanoseconds unless otherwise specified. . 

**Military Specifications are preliminary. Please contact your National Semiconductor Sales Office or Distributor for availability and specifications. 
+To insure that valid source data is written to the FIFO, FWR must occur one PH1 cycle time after DLE. | 


tTTo ensure that valid data is read from the FIFO, there must be at least two PH1 periods between FIFO write and FIFO read cycles when the FIFO read and write 
counters are equal. Please consult additional criteria in applications section of this datasheet. 


Timing Diagrams 


DEFINITIONS | 7 
All the timing specifications given in this section refer to 50% of the leading or trailing edges of the appropriate clock phase and 
0.5V or 3.0V on the appropriate signal as illustrated in the following figures, unless stated otherwise. . 


- Veco = 0.5V 


0.5V 
tsightf 


PH1 


tsightr 
SIGNAL 


tsichat tsich2r 


PH2 
TL/F/9337-9 


Note: Data is measured at 1.5V to 50% points of PH1 and PH2. 
FIGURE 21. Timing Specification Standard 


tonite 
50% 1.5V] 


PHI 50% [1.5V*] 1.5V* 


tohiH® 


PH2 50% 


TL/F/9337-10 


*TTL Clock Specification 
. FIGURE 22. Two-Phase MOS/Single-Phase TTL Clock Timing Specification 


3.0V. 


L/B, DOS VALID INPUT 
0.5V 


Vou 
Voi 


DO=D15 
PDn 


TL/F/9337~11 


FIGURE 23. Mode Control Timing Specification 
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Vec70.5V 
PH1 50% 1.5V 50% 
0.5V 
Ver=0.5V 
PH2 50% si 
0.5V 
a 3.0V 
DLE, POLE 
0.5V 
aa} ——}+$ tt 
DO=D15, PDn cal anes 3,0V 
BO/LME, B1/RME 1.5V VALID DATA nee 
B2 /FWR, B3/FRD 
/ / eas aaa Pea 
thses2t -—— ‘pseh2f 3.0V 
BSE i), Ge ey 


totistr aay 


+ totis2¢ 


3.0V 


0.5V 
TL/F/9837-12 


RESET 50% 


PH1 


PH2 


PDn : VALID PIXEL DATA 


3.0V 
: | ) Ga baa) 
Do-D15 NATAL | 


0.5V 

/ Re BE) a 
ope 0.5V 
2 3.0V 

_ 0.5V 


TL/F/9337-13 
FIGURE 25. Pixel Read Timing Specification 
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3.0V 
0.5V 


DO=-D15 1.5V 


VALID DATA 


3.0V 
0.5V 
TL/F/9337-14 
FIGURE 26. CRE and FSE Timing Specification 
See cee : 3.0V 
DOE, POE 1.5V 1.5V7 ce 
Osv 
ee 
DO=D15, | Vou 
PDn 1.5V 
| tdpzH—> | | ‘apHz 
DO - Di 5, 
PDn at Vou 
tapzi | ae tupLzZ sg. 


TL/F/9337-15 


FIGURE 27. Output Enable Timing Specification 


Vo 7 0.5V 


iis ev Vec - 0.5V 
0.5V 
PH2 0.5V 
Von 
DO = 015 . - VALID DATA ‘ VALID DATA 
Vo. 
Fe ee 
B3/FRD 0.5V 
DOE, we 3.0V 
BSE 0.5V 


. 3.0V 
DOS / —— 
SS SS SS Ss Sy. 


FIGURE 28. FIFO/DIL__Dest to Valid Data Timing Specification 
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Timing Diagrams (Continueq) 


PH1 


D (15:0) ZZTIZINEXCEE hn a oe 


tybses 
FWR 


FRD 


TL/F/9337-~40 
FIGURE 29. BSE and FWR Timing Relationships 
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National 
Semiconductor | 


General Description 


The DP8512 is a clock generator intended for use in medi- 
um- to high-performance CRT graphics systems. The device 
simplifies timing and minimizes phase skew between the 
various signals involved in the transfer of DRAM (or 
VDRAM) data into a DAC for display on a CRT. The device 
generates several synchronous clocks from a single crystal 
resonator input using digital phase locked loop (PLL) tech- 
niques. These clock signals include a graphics processor 
Clock, a raster-scan pixel clock, and various gated TTL and 


| ECL clocks required to transfer data from VRAM to video 


shift registers. Circuitry is also provided which enables the 
user to phase lock his graphics system to an external video 
source. The DP8512 is optimized for single-board graphics 


| systems. The DP8513 is a similar device intended for multi- 


board synchronous clock generation. 


The graphics processor clocks (PHI1 and PHI2) a are non- 
overlapping two-phase clocks with MOS-compatible outputs 
capable of driving 100 pF loads at up to a 20 MHz rate. 


The raster-scan pixel clock (PCLK) and the parailel load 
clock (LCLK3) required for video shift register operation are 
generated from the crystal resonator source using two pro- 
grammable counters in a phase locked loop configuration. 
Video word widths from 4 to 64 in increments of 4 are ac- 
commodated by these counters. The ECL-compatible PCLK 
outputs are capable of operating at up to 225 MHz from 
either a positive or a negative supply voltage. 


Two gated TTL clock outputs (LCLK1 and LCLK2) are also 
provided to enable easy transfer of data from a VRAM serial 


DP8512 Video Clock Generator 


shift register directly into a video shift register or indirectly 
through a FIFO. 


The graphics system can be linked to an external video 
source by means of a second phase locked loop on the 
chip. The crystal resonator can be operated as a voltage 
controlled oscillator allowing adjustment of its frequency un- 
til the system’s horizontal frequency agrees with that of the 
external source. 


Features 

mw On-chip crystal oscillator and phase-locked-loop gener- 
ate synchronized system clock, PCLK, and LCLK 

= MOS-compatible single-phase or non-overlapping two- 
phase system clock output 

m@ 225 MHz ECL differential output pixel clock (PCLK) 

™ Gated and non-gated load clock (LCLK) outputs ease 
VRAM-to-VSR synchronization 

m Accommodates video word widths from 4 to 64 in in- 
crements of 4 

@ Timing Adjust pin (Tadj) ceuldse a fixed offset adjust- 
ment of PCLK to LCLK to ease system design 

m ECL circuitry can be referenced to positive or negative 
power supply 

m Enables horizontal synchronization to an external 
source 


Block Diag FalM (numbers in parentheses indicate pin numbers) 


PHI1 PHIZ. «EXT REF 
(27) (26) (19) 


XTLB (21) 2 PHASE 
XTLC (22) 0 VCXO | TciK GEN 


; 
$0-$20 
(25, 24, 23) CNTR 


MAIN 
LOOP 

SOUT Omen | PH DET 

(32) CHG PMP 


LCLKO C 


ENINI, 2,30 
(35, 40, 41) 


LCLK TIMING AND 
CONTROL 


4 4 


ENOUT1,2,3 3 LCLK1,2,3 3 
(36,38,1,44) (37,39, 43, 42) 


| SECONDARY LOOP 
PHASE DET 
CHARGE PUMP 


RGP HOR 


(18) 


© CPO2 (16) 
© OPAMP2 (17) 


© CPOt (14) 
P-. © OPAMP! (15) 


4 
OLO=L3 (13, 12, 20, 11) 


OVCO!1 (8) 
0 VC02 (9) 
\, F==O TADS (2) 
2 
PCLK, PCL 
(4,3) TL/F/8758-1_ 
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oO 
S 
ld 
> 


41 | ENIN3 
140 9 ENINZ 


= 
oO 
> 


VEE 


vCc01 138 § ENOUT2 
vco2 LCLK1 
vcs | ENOUTI 
L3 ENIN( 
ut 34] VEE 
LO 334 LCLKO 
CPt 32} SOUT 
OPAMP1 37] VITLO 


CPO2 | 130} GNDO 


OPAMP2 29} GND! 
BISERFRP Eo 
ee OS zEE 
a 
& 
TL/F/8758--2 
Order Number DP8512V 


See NS Package Number V44A 


Pin Descriptions 


1, 44——-ENOUT3, ENOUTS: Differential ECL video enable 
output synchronous to LCLKO and gated by ENIN3 (10k and 
100k ECL compatible). Outputs require conventional ECL 
502 terminations. 


2—TADJ: Timing Adjust pin allows the PCLK output tran- 
sition to be offset by 2.5 ns relative to the ECL LCLK3 and 
ENOUTS transitions to accommodate the setup and hold 
requirements of various shift registers more easily. A Logic 
Hl = VECLO and a Logic LOW = VEE. 


3, 4—PCLK, PCLK: Differential ECL pixel clock outputs 
driven by the VCO in the main loop (10k and 100k ECL 
compatible). Outputs require conventional ECL 502. termi- 
nations. 


5--VECL1: ECL output buffer positive power supply which 
can be operated from OV to + 5.7V relative to GND. Note 1. 


6—VECLO: ECL internal logic positive power supply which 
can be operated from OV to + 5.7V relative to GND. Note 1. 


7, 34—VEE: ECL negative power supply. Note 1. 


8, 9~VCO1, VCO2: External tank circuit connections for 
the Pierce VCO. See typical applications for typical wiring. 


10—VCS: No connection required. 


11, 12, 13, 20——L3, L1, LO, L2: Four bit word input used to 
select the L Counter modulus. Any modulus from 4 to 64 
may be selected in increments of 4. LO is the least signifi- 
cant bit. A Logic Hl = VECLO and a Logic LOW = VEE. Itis 
recommended that these inputs be bypassed to VECLO with 
.01 wF and the jumper lengths be minimized and not cross 
over any TTL traces as the logic thresholds are 
VECLO — 200 mV. 

14—-CPO1: Main loop charge pump output. Used in con- 
junction with OPAMP1 to form the external loop filter. 
15—OPAMP1: Op amp output of the main loop. This output 
is used to control the pixel clock frequency via the varactor 
diodes in the LC tank circuit. 


16—CPO2: Charge pump output of the secondary loop. 
Used in conjunction with OPAMP2 to form the external loop 
filter. 

17—OPAMP2: Op amp output of the secondary loop. This 
output is used to vary the crystal fequency (VCXOQ) in sys- 
tems where it is desired to lock to an external video source. 
18—-RGP HORIZ: TTL compatible secondary loop phase 
comparator input. This signal completes the feedback path 
from the VCXO by way of the DP8500 Raster Graphics 
Processor’s Horizontal Output. The phase detector #2 is 
negative edge triggered from this pin. 

19—-EXT REF: TTL compatible detector #2 reference in- 
put. This is the optional horizontal input for systems where it 
is desired to lock to an external video source. The phase 
detector #2 is negative edge triggered from this pin. 

21, 22—-XTLB, XTLC: External connections for the Pierce 
crystal oscillator. See typical applications for typical wiring. 
23, 24, 25—S2, $1, SO: TTL compatible three bit word that 
determines the S Counter modulus. SO is the least signifi- 
cant bit. 

26, 27—PHI2, PHI1: MOS compatible two-phase non-over- 
lapping clocks. The frequency of these clocks is that of the 
crystal frequency. If these outputs drive large capacitor 
loads, a 10 wF or larger capacitor is required directly across 
the VTTL1 and GND1 pins. 

28-—-VTTL1: TTL output buffer supply. Specified for 5V 
+10% operation. Note 1. 

29—GND1: TTL output buffer supply return. Note 1. 
30—-GNDO0: TTL internal logic power supply return. Note 1. 
31—VTTLO: TTL internal logic positive power supply. Speci- 
fied for 5V +10% operation. Note 1. 

32-—~SOUT: TTL compatible ungated output of the S coun- 
ter. It is also connected to one of the two inputs of the main 
loop phase comparator. — 

33-—LCLKO: TTL compatible free-running Load clock. This 
signal is also connected to an input of the main loop phase 
comparator. 

35—ENIN1: TTL compatible video enable input. A high on 
this input starts LCLK1 on the next positive transition of 
LCLKO. 

36—ENOUT1: TTL compatible video enable output syn- 
chronous to LCLKO and gated by ENIN1. 

37—LCLK1: TTL compatible load clock equivalent to 
LCLKO, but gated by ENIN1. 

38—ENOUT2: TTL compatible video enable output syn- 
chronous to LCLKO and gated on the third positive transition 
of LCLKO following a valid ENIN2 input. 

39-——LCLK2: TTL compatible load clock equivalent to 
LCLKO but gated by ENIN2. 

40—ENIN2: TTL compatible video enable input. A high on 
this input starts LCLK2 on the next positive transition of 
LCOLKO. 

41—ENIN3: TTL compatible video enable input. A high on 
this input starts LCLK3 on the next positive transition of 
LCLKO. 

42, 43—LCLK3, LCLK3: Differential ECL compatible load 
clock synchronous to LCLKO and gated by ENINS (10k and 
100k ECL compatible). Outputs require conventional ECL 
502 terminations. 


Note 1: Refer to the Typical Supply wiring diagrams for acceptable wiring of single and dual supply applications. 
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Absolute Maximum Ratings | 
ECL Signals 


- If Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales Output Current 
Office/Distributors for availability and specifications. Supplies 
Storage Temperature —65°C to + 150°C VEE to GND. 
TTL Signals | a VECL to GND 
Inputs | - oo VTTL to GND | 
ESD susceptibility (see note 5) 


Outputs 7V 


Recommended Operating Conditions (Notes 1, 2, 3, 4) 


N 
© 


symbol |_—Parameter =| Min’, =| Typ | Max 
VITLO,1toGND | __ TTL Power Supply a ee 
VECLO,1toVEE | _ECL Power Supply a ee es, a ae 
veetoGno= = [| CT 
Vin . High Level Input Voltage | 
TTL Inputs 
ViL | ; Low Level Input Voltage V 
TTL Inputs 
fey High Level Output Current 
TTL Outputs | 
High Level Output Current 
MOS Compatible Outputs 
lo. Low Level Output Current 
TTL Outputs 
Low Level Output Current 
MOS Compatible Outputs 
Fecux (Note 6) Pixel Clock Max Frequency ee eee 
FxT | Grystal OscillatorMax Frequency | | | 
Tsu1 Setup TimeENINItoLcLKO | 20 || 
Tsuz Setup TimeENINZtoLOLKO | 20 || 
Tsus Setup TimeENINStoLOLKO | 20 | tt | ts 
Tay HoldTimeLCLKOtoENINY =| | 8 
THe Hold Time LCLKO to ENIN2 a a 
THs Hold Time LCLKO to ENING ee ee 


Tambient Operating Temp Range 
Note 1: See Timing Waveforms for relevant signal edges (positive or negative) from which all setup and hold times measurements are made. 
Note 2: TTL inputs—ENIN1, 2, 3, RGP HORIZ, EXT REF, SO, Si, S2. 

Note 3: TTL outputs—SOUT, LCLKO, 1, 2, ENOUT 1, 2; MOS Outputs—PHI1, 2. 


—50 mA 


—7V to +0.5V 
—-0.5V to +7V 


—0.5V to +7V 
1000V 


Units 


MHz | 
MHz 


Note 4: Inputs LO, L1, L2, L3, Taps designed to be tied to VECL for high level or shorted to VEE (or left open) for low jevel. See input schematics. 


Note 5: Human body model; 120 pF thru 1.5 kf. 


Note 6: Fecik is the maximum frequency that the pixel clock output can be reliably “locked”. The VCO range should be controlled to avoid exceeding 235 MHz 


when the maximum control voltage correction is applied. 
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Symbol Conditions 


Vic Input Clamp VTTLO, 1 = 4.5V, lin = —18mMA 
Voltage 


VoH |OutputHigh |VTTLO,1 = 4.5V|TTLOutputs, lon = —400uA sid 2 
Voltage to 5.5V VITL— 

vITL= 
=113 
amreaats VECLO=1.2 


=~ 
“< 

TG 

c 
= 
- 
@ 


5 

RO 
I 
— 
nn 
< 


jo) 
a 
< 


ie) 


NO 


| 
© 
se] 
ro) 
3 
< 


Voltage 


lo. = +100 pA 


lo. = 20 mA 


VECLO, 1 = OV _|ECL Outputs, 509 Load to —2V 


VEE = —4.2V | OPAMP Output, Io. = —1.25 X Iopo Source 


I; Max High Level| VTTLO, 1 = 5.5V, TTL Inputs, Vin, = 7V 
Input Current 

iH High Level VTTLO, 1 = 5.5V, TTL Inputs, Vin = 2.7V 
Input Current 

ln = | Low Level VTTLO, 1 = 5.5V, TTL inputs, Viy = 0.4V | 
Input Current , 


lo | Output Drive |{VTTLO, 1 = 5.5V| TTL Outputs, Vo = 2.25V 


Current MOS Outputs, Vo = 2.25V 


Icpo Charge Pump |VEE = —4.2V to —5.7V | 


VoL Output Low VTTLO, 1 = 4.5V| TTL Outputs, lo. = 8MA 


100 pA 


| TRI-STATE® —10 
VTTLO, 1 = 5.5V TTL Supply 
VECLO, 1 = OV | VEE = -—5.7V, 10k ECL Supply Range 


VEE = —4.8V, 100k ECL Supply Range 
Note 1: TTL inputs—ENIN1, 2,3, RGP HORIZ, EXT REF, SO, $1, $2 

TTL outputs—ENOUT1, 2, SOUT, LCLKO, 1, 2 

MOS outputs—PHI1, PHI2 

ECL outputs—ENOUT3, ENOUTS, PCLK, PCLK, LCLK3, LCLK3 


4 
> 


loc 


| | 
© (an) wu _ 
3 
> 


AC Electrical Characteristics 


Symbol Parameter Conditions 
FMAX Max VCO Freq (Note 2) Self Osc 


ao) 


ae [eee | wa 
eonnue a ie ee 

ea 

ee oe ie eee 
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AC Electrical Characteristics (continued) 


Note 1: Rise, Fall times measured from 0.5V to VTTL — 2V with CL = 50 pF. 
Note 2: Fyyax is not production tested but is assured by characterization to include sufficient margin beyond processing extremes. 


Circuit Operation 


The heart of the DP8512 Video Clock Generator is a crystal 
oscillator which is used as a frequency reference to gener- 
ate several clock signals required in a video display system. 
Among the clocks generated are the two-phase clock for 
driving the RGP and BPU processors, ECL pixel and load 
clocks (PCLK and LCLK3) for high speed video shift register 
parallel load and shift operations, and TTL load clocks 
(LCLKO, LCLK1, and LCLK2) for moving DRAM and FIFO 


data to the video shift registers. The LCLK and PCLK out- 


puts are ail internally synchronized in order to simplify sys- 
tem timing. . 

The two-phase graphics processor clock (PHI1 and PHI2) 
operates at the frequency of the crystai oscillator. It is capa- 


_ ble of directly driving the raster graphics and BIT-BLT proc- — 


essors. The two-phase clock is closely aligned with the oth- | 


er clocks generated by the device to maximize system oper- 


ation. The PCLK and LCLK outputs are generated using a 
digital phase locked loop as shown in Figure 7. 


SOUT 
> 


PHASE 
COMP 


| TL/F/8758-10 
FIGURE 1. PLL Block Diagram 
The loop consists of the S and L counters, a phase compar- 
ator, and a voltage controlled oscillator (VCO) with the rela- 
tionship between these elements in the loop defined as: 


VCXO XL 


PCLK = S 
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PHI1 to PHI2 . 


where PCLK is the pixel clock frequency, L is the L Counter 
modulus, and S is the S Counter modulus. When the fre- 
quency of the VCO (PCLK) in the phase locked loop is sta- 
ble the inputs to the phase detector are in phase, thus the S 
Counter and L Counter outputs are identical in both phase 
and frequency. The crystal oscillator ensures that the phase 
and frequency of the S Counter output remains constant. 
Any drift, or change in frequency, of the VCO will be divided 
down and appear as a shift in phase at the L Counter out- 
put. The phase detector will sense this phase error and gen- 
erate a correction voltage for the VCO input which is propor- 
tional to the magnitude of the frequency error. This correc- 
tion voltage will change the VCO frequency to eliminate the 
error thus keeping the loop locked. The correction voltage 
adjusts the VCO’s frequency by changing the capacitance 
of the varactor in the LC oscillator tank circuit. The varac- 
tor’s capacitance is proportional to the amount of reverse 
bias applied across it. The VCO correction voltage is provid- 
ed from the OP AMP output which is a 3V typical operating 
range (VEE + 0.5V to VELCO —1.2V). This operating range 
provides typically a 10% VCO frequency tuning range. 
This operating range limits the device from being used at 
multiple pixel rates unless an external op amp is added to 
extend the tuning voltage range or tank circuit components 
are bandswitched in. . 


The presence of the S Counter in the loop enables the 
graphics processor to operate at full speed independent of 
PCLK frequency. The video shift register’s parallel data 
width determines the L Counter modulus. An 8-bit parallel 
shift register would use an L Counter modulus of 8 so that a 
parailel load pulse occurs once every 8 pixel clocks. The L 
Counter output is used to derive the four LCLK outputs. 
These 4 LCLKs differ slightly in format to allow for the vari- 
ous system configurations highlighted in the following sec- 
tion. — / 


Circuit Operation (Continued) 


The ECL LCLK3 output is used in conjunction with the PCLK 
output to load data into a high-speed video shift register. 
The PCLK provides the clock and LCLK3 provides the load 
signal for the shift register. With a typical video shift register 
operating at frequencies around 200 MHz timing can be- 
come extremely critical. For this reason a Timing Adjust pin 
(Tadj) is provided that will allow the user to obtain the opti- 
mum LCLK3-to-PCLK setup and hold timing relationship. 


The other three LCLK outputs (LCLKO, LCLK1, LCLK2) are 
TTL outputs. They can be used to contro! a selection of 
different DRAMs/video shift register configurations as 
shown in the system architecture section which follows. 


Also included on the chip is a secondary phase locked loop 
which can be used to synchronize the graphics system to an 
external signal such as the horizontal sync pulse from a 
television broadcast. A block level diagram of this mode of 
operation is shown in Figure 2. The crystal oscillator is con- 
figured to operate as a voltage controlled crystal oscillator 
(VCXO). Any change in frequency of the VCXO forces the 
outputs of the S Counter and the RGP’s horizontal counter 
to shift in phase relative to the external horizontal reference 
signal. The auxiliary phase comparator senses any phase 
differences between its two inputs and produces a correc- 
tion voltage back to the VCXO which is proportional to the 
amount of the frequency error. This correction voltage will 
change the VCXO frequency to eliminate the error thus 
keeping the loop locked. 


TL/F/8758-11 


FIGURE 2 


TABLE I. Partial Table of PCLK Frequencies 
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For most applications the crystal oscillator will be in the 
range of 20 MHz. With a typical 8-bit system this means that 
the PCLK output will be running at 20 MHz x 8 bits = 
160 MHz. However, if the system is 16 bits wide instead of 8 
bits the PCLK frequency would become 20 MHz x 16 bits 
= 320 MHz which is beyond the range of the VCO. There- 
fore the S Counter block must be added to divide the crystal 
oscillator frequency down to a more manageable frequency. 
Using the equation and the above example of a 20 MHz 
crystal oscillator with 16 bits of data, the S Counter is used 
as a divide-by-two counter to get a PCLK frequency of 
160 MHz. 


The S Counter can be programmed to divide by any integer 
up to 8 and the L Counter can be programmed for any word 
width from 4 bits to 64 bits in increments of 4. Table | shows 
some of the frequencies possible using various values for 
the S- and L-modulus. 


VARIOUS SYSTEM ARCHITECTURES 


Figure 3 demonstrates the DP8512 in a system using a vid- 
eo shift register and a DRAM. Another possible application 
uses a video DRAM, or VDRAM, in place of the DRAM, as 
shown in Figure 4. This system differs from Figure 3 in that 
the VDRAM contains an internal shift register which allows 
memory to be randomly accessed while data is being output 
to the video shift registers through the serial port. The out- 
put of the VDRAM differs from the standard DRAM in that 
data is not valid at the output of the internal shift register 
until an SCLK pulse is provided. The offset in the LCLK3 
waveform from LCLK1 is obtained by connecting ENOUT1 
to ENINS. 


The third type of system employs a video shift register (such 
as the DP8515) that contains an onboard FIFO as shown in 
Figure 5. This architecture simplifies timing requirements in 
a high-speed multiboard system. The propagation delay 
time from the DRAM to the video shift register through the 
back plane is no longer restricted to be less than the clock 
frequency of the system. The VDRAM can be writing into 
the FIFO asynchronous to the data being loaded into the 
shift register. As in the previous systems, one LCLK1 pulse 
transfers the data to the VDRAM output. LCLK2 then writes 
several words into the FIFO prior to LCLK3 reading out the 
first word. The connections required to obtain this timing are 
shown in the diagram. 


32-Bit Word (L = 32) 
PCLK Frequency 


na 
na 
na 
160.0 MHz 
128.0 
106.7 
91.4 
80.0 
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Circuit Operation (Continued) 


DP8512 


| DATA 


~— [LCLK3 PL. f 
| PCLK CK, | VSR 
| | TL/F/8758~3 


PCLK 


LCLK3 | | oe | 
| . TL/F/8758—4 
FIGURE 3. System Configuration and LCLK Waveforms using DRAM 


DATA 


LCLK3 PL 


PCLK CK, 4 VSR | 


ENOUT! - 
|__| vec 
ENINS’ , 
TL/F/8758-—5 
PCLK 
LCLK1 | | | 
LCLK3 | | | | 


. TL/F/8758-6 
FIGURE 4. System Configuration and LCLK Waveforms using VDRAM 


DATA 


ENOUT1 


LCLK2 WR 


FIFO 
VOCG FLCLK3 pL : 
PCLK CK, | VSR 


ENIN2 
ENOUT2 


ENINS 


TL/F/8758-7 


PCLK | a i] | | | | | | | | | | | 1 | LJ | [ | | | | | | | | 
LCLK1 | | | 


TL/F/8758-8 
LCLK1 | | | | | | | 
LCLK2 | | | | | 
LCLKS | L | | 
TL/F/8758-9 


FIGURE 5. System Configuration and LCLK Waveforms using a VSR Containing FIFO 
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AC Timing Waveforms 


cLS8dG 


TSETUP, THOLD 


LCLKO 50% WN 
TSU1, 2,3 TH1, 2,3 
ENIN',2,3 50% — 


PCLK “Ff 
TPCLK 
LCLK3, ENOUT3 50% 
TL/F/8758-13 


TLCLK1, 2,3 
TEO1, 2,3 


TL/F/8758-12 


LCLKO 50% -—— 


LCLK1, 2,3 50% 


ENOUT!,2,3 50% 
TSOUT 
Gee eeenrarrmctecnncorainmmremneiane 
SOUT 50%—- 
TL/F/8758-14 
SOUT 50% -—- X x 
TSO=PHI1 
PHI1 50% 10% 


—TNO 
PHI2 10% 


TL/F/8758-15 
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DP8512 Functional Waveforms 


CRYSTAL [ ] | [7 [1 [~ ~ 1 | = | 


PHI1 


PHI2 


TL/F/8758—16 
Note 1: SOUT waveform displayed with S Counter in divide-by-4 mode. 


Load Clock Timing Diagram 
ENIN‘ 7 ns 
LCLK1 | | | | | | J = | aoe 
ENOUTI | ——- | —— 71 


TL/F/8758-17 


LCLKO 1 1 1 1 TL 1 1 c 7 m MJ MJ [— 
<A ea or (ee 
LOLK2 mM mi in 1 1 | | | 

ENOUT2 ee ees: 


TL/F/8758-18 


LCLKO MJ MJ MJ M7 J ] i 7 | | | | | 
LOLK3 7] 7] 7] 1 mM 
aS 


TL/F/8758-19 
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Counter Modulus Tables 


S Counter Inputs 


L = TTL Logic Low Level 
H = TTL Logic High Level 


Typical System Diagram 


HORIZ 
OUT 


EXT HORIZ 
= EXT REF 
SYNCH 


CPO2 


XTL 
ENOUTS 


LOOP 
rer |___ 
VIDEO ENABLE 


TO DAC 
SERIAL | PL 
e 2 

VIDEO DATA 


RGP SOUT PHI 


LC 
LCLK3 PCLK LCLK2 


<< , 
OUT : 


L Counter Inputs 


ad 
Go 


r 
NO 


rm 
coh, 


rr 
© 


0 = VEE or open 
1 = VECLO, 1 


DP8500 
RGP 
IN 


DP8512 
VCG 


CK OW 


DP8515 


VSR VIDEO DATA 
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Input Schematics (note 1) 


_ VCXO Inputs 


TL/F/8758-21 


VEE 


Timing Adjust Input 
- VECLO 


Note 1: Refer to the typical ESD circuit for all schematics. 


_ TL/F/8758-23 
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ENIN1, 2,3 
RGP HORIZ 
EXT REF 


LO-L3 Inputs 
- VECLO 


= = = VECLO = 200 mV 


VEE 
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S Counter Inputs (TTL) 


TL/F/8758-24 


TL/F/8758-—26 


Output Schematics 
TTL Outputs 
VTTLt 
260 
ENOUTI, 2, LCLKO, 1, 2, SOUT 
GND1 
TL/F/8758-—27 
PHI1, PHI2 Outputs 


TL/F/8758-—29 


Op Amp Output 
VECLO 


OPAMP, 2 


VEE 
TL/F/8758-31 
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ECL Outputs 
VECL1 
PCLK, PCLK 
LCLK3, LCLK3 
ENOUT3, ENOUT3 
10 KO 
VEE 
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Charge Pump Output/Op Amp Input 


OP AMP INPUT 
DIFF. PAIR 


Typical ESD Circuit 


--- 
| 
| 


a 


4 
| 


| 
| 
| 
I 


CPO 
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Typical Supply Wiring 


Single Power Supply Operation 


VITL1 VECO 


it VECLI F 


+4.5V T0 
+5.5V 
REGULATOR 


DP8512 


fcnpo  GND1 VEE VEE 


TL/F/8758-—34 


Dual Power Supply Operation 


+4,5V T0 
+5.5V. 

REGULATOR euenae 

VITL1 = VECLO 


DP8512 
| GNOO  GNDt VEE 
—5.7V 10 


—4.2V — 
REGULATOR 
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Typical Applications 


Crystal Reference System 


C2 
0.1 uF 


eee ee 


MV209 
30 pF 


(200 MHz) 


ss . : — a 


&eane ean ae eae @ @ © @& 


TL/F/8758-32 


1-132 


Typical Applications (continued) 


External Reference System 


EXTERNAL 


CERAMIC 
RESONATOR 


XTLB 
OP AMP 2 


Loop Filter Caiculations 

Several constants need to be known in order to determine 
the loop filter components. They are the loop divide ratio N, 
the phase detector gain Kp, the VCO gain Ko, the loop 
bandwidth w>,. and phase margin ¢. 

The constant Kp is fixed at 80 wA/rad for the DP8513. N is 
simply the L counter modulus for the main loop. For the 
secondary loop, N is the S counter modulus times any exter- 
nal division between the SOUT pin and the RGP HORIZ pin. 
(i.e., if S = 1 and there is a +100 counter between SOUT 
and RGP HORIZ, N = 1 x 100 = 100.) A 60° phase mar- 
gin is recommended, however, the equations allow other 
values to be used if desired. 

The oscillator gain constant of Ky can be obtained from 
Table IIl or determined experimentally. This is done by driv- 
ing the 27k resistor which normally connects the varactor to 
the op amp output with an external power supply. Set the 
supply to Vee + 3V and note the PCLK frequency. Next set 
the supply to Vee + 2V and note the frequency again. The 
difference in these two frequencies (times 27r to convert to 
radians) is Ky. For optimum performance, the desired PCLK 
frequency should be somewhere between the two frequen- 
cies measured above. This may require adjustment of the 
coil. 

Before choosing a value of wo, one fact should be pointed 
out. The 27k resistor and the 500 pF coupling capacitor 
between the coil and the varactor form a low pass filter with 
a cutoff of about 12 kHz. Thus, the loop bandwidth must be 
chosen to be less than this value. We recommend 27 X 
100 Hz to 27 X 3 kHz for wo. 

Having found all these constants, the following equations 
are used to find the component values: 


1.08 Nw 3.46 Ky K 0.27 Ky K 
EO cl a eS Ns eS I 
Kp Ko N Wo? N Wo? 


C1 
1 pF 


Rl 
20002 


_I 


0.037 pH 


(200 MHz} 


L m= nn EE EP all 
TL/F/8758-36 


To use a phase margin of other than 60°, use the following: 


Gree +4 
= 2 Kp Ko (cosec b ) 
2K, K 
po 
C1 = t 
Noe ano 


C2 = Ni. 2 b — tan db 
sec 
N Wo 


Example: Design a system with the following characteristics: 

External horizontal sync of 100 kHz 

1560 pixels per line (2000 pixels including retrace) 

20 bit wide video data 

10 MHz processor rate 
Note that this system will sync to an external source so that 
both loops must be used. 
The PCLK frequency will be 100 kHz x 2000 pixels per line 
= 200 MHz. The components in Table III will be used. Note 
that Ko = 24 Mrad. Because it is a 20 bit wide system the L 
counter modulus must be 20. 
By choosing w, = 27 * 2800 Hz, the equations give Ri = 
2000, C1 = 1.0 wF and C2 = 0.08 pF (use C2 = 0.1 pF). 
In the secondary loop, a ceramic resonator is used in place 
of a crystal to allow more pullability. Care should be taken 
when selecting a resonator to ensure its Q value in low. Its 
Kg is found experimentally to be 84 krads/V. The SOUT 
frequency will be the same as LCLKO or 200 MHz + 20 = 
10 Mhz. Thus, there must be a +100 counter between 
SOUT and RGP HORIZ so N = 100 for the secondary loop. 
In choosing wo, it should be noted that w, for the secondary 


' loop should be smaller than w,. for the primary loop so that 
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the main loop will be able to track the secondary without 
losing lock. Picking w) = 2a X 750 Hz gives R1’ = 75k, 
C1’ = 0.01 wF and C2’ = 820 pF. 
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Loop Filter Calculations (Continued) 

lf a resonator is not available, a coil can be substituted. 
Since the tolerance of most coils is large, this will require 
that this coil be tuned to the center frequency of the internal 
oscillator. in this case, the varactor’s capacitance needs to 
be chosen as a small percentage of the tanks total net ca- 
pacitance in order to restrict its contribution to the variation 
in the reference frequency. 


BOARD LAYOUT CONSIDERATIONS 


The VCO components/circuitry need to be isolated from 
such signals as PHI1, PHI2, and SOUT. These lines can 
produce sufficient noise to cause the DP8512’s VCO or 
even an external VCO to attempt to injection lock to them. 


lf this problem is present there will be a low frequency oscil- 
lation (1-30 kHz) on top of the operational amplifier’s DC 
tuning voltage. This oscillation indicates that the VCO is try- 
ing to lock to the phase transitions of the noise. The noise 
disrupts the ability of the VCO to provide a linear voltage to 
frequency translation and the loop will tend to wander. 


Recommended VCO Components 


4 
Fyco = 
cae 2aiLCror 


Ctot = 


Another test that can be done to verify the existence of this 
problem is to add a large vaiue (1 MQ) variable resistor 
between the CPO pin and ground. This resistor will cause 
the loop’s reference and feedback phase transitions to shift 
relative to one another. Although these two signals’ phase 
relationship is being varied, they should continue to be run- 
ning, in lock, at the correct frequency. This should not result 
in any change in the operational amplifier’s DC voltage. If 
the VCO is injection locking, the operational amplifier’s DC 
voltage will be observed to vary as the resistor value is var- 
ied. Additionally it can be observed by comparing LCLK and 
SOUT simultaneously, the part will go from locking solidly 
with zero phase jitter to large phase steps. This will happen 
at several resistor settings (phase offsets). 

The DP8512/13/14 Video Clock Generator Evaluation 
Board application note demonstrates an external VCO lay- 
out which minimizes the sensitivity to injection locking and 
also provides other helpful hints related to board layout. 
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TABLE Ill. Recommended VCO Components 


Frequency | L | TOKO 
| (MHz) ywH Part # 


[eo | ease [ esnanseccoces | se | s0 | wvz0o | 16 
a8 : 

[400 | 012 | esoenwssccones | so | 30 | wvz00 | 21 
es 
2 


200 0.037 | E502HNS-1000029 | 10 
| 220 | 0.037 | E502HNS-1000029 


CA 
PF | pF | Motorola #_ 


Ko 
Mrad/V 


| 30 | mvzoo | 19 
1 

| 30 | Mvz0o | st 
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DP8513 Multi-Board Video Clock Generator 


General Description 


The DP8513 is a clock generator intended for use in medi- 
um- to high-performance CRT graphics systems. The device 
simplifies timing and minimizes phase skew between the 
various signals involved in the transfer of DRAM (or 
VDRAWM) data into a DAC for display on a CRT. The DP8513 
is used in conjunction with the DP8514 Crystal Clock Gener- 
ator to simplify synchronization problems in multiboard sys- 
tems. The device generates several synchronous clocks 
from a reference input using digital phase locked loop (PLL) 
techniques. These synchronous clocks include a graphics 
processor clock, a raster-scan pixel clock, and various gat- 
ed TTL and ECL clocks required to transfer data from 
VRAM to video shift registers. Circuitry is also provided 
which enables the user to phase lock his graphics system to 
an external video source. 


In a multiboard system the REFIN and REFCLK inputs en- 
able the motherboard and the slave boards to be synchro- 
nously driven from a single master clock source such as the 
DP8514. 


The graphics processor clocks (PHI1 and PHI2) are non- 
overlapping two-phase clocks with MOS-compatible outputs 
capable of driving 100 pF loads at up to a 20 MHz rate. 

The raster-scan pixel clock (PCLK) and the parallel load 
| clock (LCLK3) required for video shift register operation are 
generated from the REFIN and REFCLK inputs using two 


programmable counters in a phase locked loop configura- 
tion. Video word widths from 4 to 64 bits in increments of 4 
are accommodated by these counters. The ECL-compatible 
PCLK outputs are capable of operating at up to 225 MHz 
from either a positive or a negative supply voltage. 

Two gated TTL clock outputs (LCLK1 and LCLK2) are also 
provided to enable easy transfer of data from a VRAM serial 
shift register directly into a video shift register or indirectly 
through a FIFO. 


Features 

m Phase-locked-loop generates synchronized system 
clock, PCLK, and LCLK 

m MOS-compatible single-phase or non-overlapping two- 
phase system clock output 

@ 225 MHz ECL differential output pixel clock (PCLK) 

= Gated and non-gated load clock (LCLK) outputs ease 
VRAM-to-VSR synchronization 

m Accommodates video word widths from 4 to 64 bits in 
increments of 4 

@ Timing Adjust pin (TADJ) provides a fixed offset adjust- 
ment of PCLK to LCLK to ease system design 

m ECL circuitry can be referenced to positive or negative 
power supply 

m Enables horizontal synchronization from an external 
source 


Block Diag ramM (numbers in parentheses indicate pin numbers) 


EXT REF 
(19) 
@ 


PHIt 
(27) 


PHI2 
(26) 


2 PHASE 
1} CLK GEN 


REFCLK (21) © 
REFIN (22) © 


SO = S2 


3; | 
(25, 24,23) 07 


MAIN 
LOOP 
PH DET 
CHG PMP 


SOUT (32) © 


LCLKO (33) © 


LCLK TIMING 


ENIN1, 2,3 3 
O—— APT AND CONTROL 


(35, 40, 41) 


RGP HORIZ 
(18) 
@ 


P02 (16) 
OPAMP2 (17) 


CP01 (14) 
OPAMP (15) 


LO-L3 
(13, 12, 20, 11) 


vCO1 (8) 
vc02 (9) 


4 © 
ENOUT1, 2, 3,3 LCLK1, 2, 3,3 PCLK, PCLK 
(36, 38, 1, 44) (37, 39, 43, 42) (4, 3) 
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Connection Diagram 


OPAMP2 [1/) 


diupeee shee 
Oo 5 a IO ~w 2 Zz 
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S 
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zEE 


| | TL/F/9283-2 
Order Number DP8513V 
See NS recaoge Number V44A 


Pin Descriptions 


1, 44 


3,4 


6 


-ENOUT3, ENOUTS: Differential ECL video 
enable outputs synchronous to LCLKO and 
gated by ENIN3 (10k and 100k ECL compati- 
ble). Require conventional ECL 502 termina- 
tion. 

TADuJ: Timing Adjust pin allows the PCLK 
output transition to occur 2.5 ns earlier than 
LCLK3 and ENOUTS when taken to a high 
logic level. This allows the set-up and hold 
times of various Video Shift Registers to be 
accommodated. A logic high is VECLO, and a 
logic low is VEE. 

PCLK, PCLK: Differential ECL pixel clock 
outputs driven by the VCO in the main loop 
(10k and 100k ECL compatible). Require 
conventional ECL 502. termination. 

VECL1: ECL output buffer positive power 
supply. 

VECLO: ECL internal logic positive power 
supply. 


Note: VECLO and VECL1 can be operated from OV to +5.7V relative to 
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7, 34 
8,9 


10 


VEE: ECL negative power ‘supply. 
VCO1, VC02: External tank circuit connec- 
tions for the Pierce VCO. See typical applica- 


tions for typical wiring. 


VCS: No connection required. 


11, 12, 13, 20 L3, L1, LO, L2: A four bit word input used to 


14 


15 


16 


17 


18 


19 


21,22 


23, 24, 25 


select the L Counter modulus. Any modulus 
from 4 to 64 may be selected in increments 
of 4. LO is the least significant bit. High = 
VECLO, Low = VEE. These pins should be 
bypassed to VECLO with 0.01 pF caps. 
These traces should not be allowed to cross 
any signal traces. 


CPO1: Main loop charge pump bitput Used 


in conjunction with OPAMP1 to form the ex- 


ternal loop filter. 


OPAMP1: Op amp output of the main loop. 


This output is used to control the pixel clock 
frequency via the varactor diodes in the LC 
tank circuit. 


CPO2: Charge pump output of the secondary 
loop. Used in conjunction with OPAMP2 to 
form the external loop filter when synchroni- 


_ zation to an external video source is desired. 


OPAMP2: Op amp output of the secondary 
loop. This output is used to vary the crystal 
frequency (VCXO) in systems where it is de- 
sired to lock to an external video source. 


RGP HORIZ: TTL compatible secondary loop 
phase comparator input. This signal com- 
pletes the feedback path from the VCXO by 
way of the DP8500 Raster Graphics Proces- 
sor’s Horizontal Output. The falling edge trig- 
gers the secondary phase detector. 


EXT REF: TTL compatible external horizon- 
tal reference input. This is the optional hori- 
zontal input for systems where it is desired to 
lock to an external video source. The falling 
edge triggers the secondary phase detector. 
REFCLK, REFIN: Reference source inputs 
from which all output clocks are generated. 
See the typical applications for typical wiring. 
$2, $1, SO: TTL compatible three bit word 


that determines the S Counter modulus. SO is 


the least significant bit. 


Pin Descriptions (Continued) 


26, 27 


28 


29 
30 


31 


32 


33 


35 


PHI2, PHi1: MOS compatible two-phase 
non-overlapping clocks. The frequency of 
these clocks is that of the REFIN input. Use 
of these outputs to drive large capacitive 
loads requires the use of high frequency by- 
pass caps across VTTL1 and GND1 as close 
to the part as possible. 

VTTL1: TTL output buffer supply. Specified 
for 5V +10% operation. 

GND1: TTL output buffer supply return. 
GNDO: TTL internal logic power supply re- 
turn. 

VTTLO: TTL internal logic positive power 
supply. Specified for 5V +10% operation. 
SOUT: TTL compatible ungated output of the 
S Counter. It is also connected to one of the 
inputs of the main loop phase comparator. 
LCLKO: TTL compatible free-running Load 
Clock. This signal is also connected to an in- 
put of the main loop phase comparator. 
ENIN1: TTL compatible video enable input. A 
high on this input starts LCLK1 on the next 
positive transition of LCLKO. 
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36 


37 


38 


39 


40 


41 


42, 43 


ENOUT1: TTL compatible video enabie out- 
put synchronous to LCLKO and gated by 
ENIN1. 

LCLK1: TTL compatible load clock equiva- 
lent to LCLKO but gated by ENIN1. 
ENOUT2: TTL compatible video enable out- 
put synchronous to LCLKO and gated on the 
third positive transition of LCLKO following a 
valid ENIN2 input. 

LCLK2: TTL compatible load clock equiva- 
lent to LCLKO but gated by ENIN2. 

ENIN2: TTL compatible video enable input. A 
high on this input starts LCLK2 on the next 
positive transition of LCLKO. 

ENIN3: TTL compatible video enable input. A 
high on this input starts LCLK3 on the next 
positive transition of LCLKO. 

LCLK3 LCLK3: Differential ECL compatible 
load clock synchronous to LCLKO and gated 
by ENIN3 (10k and 100k ECL compatible). 
Require conventional ECL 502 termination. 
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Absolute Maximum Ratings 

If Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales 
Office/Distributors for availability and specifications. 


Storage Temperature 


TTL Signals 
inputs 
Outputs | 


—65°C to + 150°C 


7.0V 
7.0V 


ECL Signals 
Output Current 

Supply 
VEE to GND 

- VECL to GND 
VTTL to GND 


ESD susceptibility (see Note 5) 


Recommended Operating Conditions (notes 1,2, 3, 4 


| Symbol 


VTTLO, 1 
to GND 


VECLO, 1 
to VEE 


VEE to GND 
Vy 


Vit 


lOH 
lot 


FPCLK (Note 6) 
Fin MAX 
Tsut 

Tsu2 

Tsu3 

Tsu4 

Thi 

Tha 

Ths 

Th4 
Tambient 


Parameter 


TTL Power Supply 


ECL Power Supply 


High Level Input Voltage 
TTL Inputs 


Low Level Input Voltage 
TTL Inputs 


High Level Output Current 
TTL Outputs 


High Level Output Current 
MOS Compatible Outputs 


Low Level Output Current 
TTL Outputs 


Low Level Output Current 
MOS Compatible Outputs 


Pixel Clock Max Frequency 


REFCLK Frequency, REFIN = fREFCLK/2 


Setup Time ENIN1 to LCLKO 
Setup Time ENIN2 to LCLKO 
Setup Time ENIN3 to LCLKO 
Setup Time REFIN to REFCLK_“— 
Hold Time LCLKO to ENIN1 

Hold Time LCLKO to ENIN2 


B.S 
Nm 


| 
an 
N 


ah, 
and, 


—50.0 mA 


—7V to 0.5V 
—0.5V to 7V 
—0.5V to 7V 


1000V 


Units 


on 
N 
< 


< 


—0.4 


MHz 
MHz 


| 
P 


| HoldTimeLcLKotoeNINS | S| 9 | ts 
| _HoldTimeREFOLK-~torEFIN | 2 | | 
| OperatingTemp.Range | | | | 


Note 1: See Timing Waveforms for relevant signal edges (positive or negative) from which all setup and hold times measurements are made. 

Note 2: TTL inputs; ENIN1, 2, 3, RGP HORIZ, EXT REF, SO, $1, S2, REFIN, REFCLK 

Note 3: TTL outputs; SOUT, LCLKO, 1, 2, ENOUT1, 2. 
MOS outputs; PHI1, 2 

Note 4: Inputs LO, L1, L2, L3, and TADJ designed to be tied to VECL for high level or shorted to VEE (or left open) for low level. See input schematics. 

Note 5: Human body model; 120 pF thru 1.5 kQ. 


Note 6: Maximum frequency that PCLK can reliably be “locked”. The VCO range should be controlled to avoid exceeding 235 MHz with the maximum control 
voltage applied to the VCO. 
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| 
DC Electrical Characteristics (note 1) o 
Pa 


Oo 
ms 
< 


Mosoumue Hum t0oeA _ 

lo. = 20 mA 

=109 
aaa 


iF Max High Level | VTTLO, 1 = 5.5V, TTL Inputs, Vij, = 7V 
Input Current 

iH High Level Input] VTTLO, 1 = 5.5V, TTL Inputs, Vj, = 2.7V 
Current | 

Ne Low Level Input |}VTTLO, 1 = 5.5V, TTL Inputs, Vj, = 0.4V 
Current 


lo Output Drive VTTLO, 1 = 5.5V | TTL Outputs, Vo = 2.25V 
Current MOS Outputs, Vo = 2.25V | 


a 


—1490 | mV 
VEE+0.5| V 


Symbol| Parameter | === Conditions =| Min. Typ | Max [Units 
Vic VTTLO, 1 = 4.5V, lin = —18mA | ae ‘ 
Vou TTLOutputs,lon=—400pA | vTL-2 || 
fescwms fou Wek ___ etapa V 
| -23| | 
-1135 | | -80_| mv 
VEE = ~42V [OPAMP Output, lo = —1.251CPOSink ——|VECLO-1.2| | |S 
VoL VITLO, 1 = 4.5V Ee OR! 
pf os | 
208. | 
|| = 1490 | 
| VEE+ 05 


< 
a 
c 


100 pA 


IcpPo «06: [Charge Pump_—s | VEE = —4.2V to —5.7V 1.0 | mA 
Current VECLO, 1 = OV 10 a 
=10 HA 

loc | Supply Current 


=) : 
=| 
> 


VECLO,1 = OV {VEE = —5.7V, 10k ECL Supply 
VEE = —4.8V, 100k ECL Supply 


Note 1: TTL inputs; ENIN1, 2, 3 RGP HORIZ, EXT REF, SO, S1, S2, REFIN, REFCLK 
TTL Outputs; ENOUT1, 2, 3, SOUT 
MOS Outputs; PHI1, PHI2 
ECL Outputs; ENOUT3, ENOUTS, PCLK, PCLK, LCLK3, CCLK3 


185 


wk 
G 
on 
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Functional Waveforms 
| . DP8513 Functional Waveforms 


SOUT 
(NOTE 1) 


LCLKO a | 
Note 1: SOUT waveform displayed with S Counter in divide-by-4 mode. 


Load Clock Timing Diagram 


woo | LP LP LILI LILI LILI LILI LILI 
ENIN1 | | | 


LOLK1 TLELI LILI oo - 


ENOUTI | | | | 
co [LP LP LPF LIF LILI LULI LILI LILI] 


TL/F/9283-17 


TL/F/9283-18 


ENIN2 | | a. 


LCLK2 TLILE LILI] 
ENOUT2 | 


ENINS ee 
LCLK3 TLELILILI 
rr ( 
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AC Timing Waveforms 
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TSETUP, THOLD 
LCLKO to ENIN1, 2, 3 


LCLKO 50% 


TSU1, 2,3 THI, 2,3 
ENINI, 2,3 
TL/F/9283-13 
PCLK 50% 
TPCLK 
LCLK3, ENOUTS 50% 
TL/F/9283-14 
LCLKO 
TLCLK1, 2,5 
LCLK1,2,3 
TEO1, 2,3 
ENOUTI, 2,3 
TSOUT 
SOUT 50% aaa aaa 


TL/F/9283-15 


SOUT 


PHI1 


TL/F/9283-16 
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AC Electrical Characteristics 


symbol | Parameter, |= Conditions | Min | Typ | Max | Units 
FMAX | MaxvCOFregNote2) | 8 || Me 
FMAX | MaxREFOLKFreg | 8 Me 
PW | PulseWidthHigh PHI1,2_ | REFCLK=40MH2,REFIN=20MHz | | a1 | | ns 
Teor | tcixotoenours =| | He os 
Teor | Lcikotoenoura, | | | He os 
Teos_ | tcikotoenours, | 8 | 7 | | os 
TpoLK Waussenourswrax | Piaget 9 28 | 50 ns 

| TADJ=Lowivee) | -26 | 0 | +25 | ns 
Trou | tctkororcuxt = |B | 
Tuouxe | Lcikotoucuke | | ts 
Tuouxs | tcikotorcuks | Ht | 7 | ots 
Tew | LOLKotoPHII” | Loop Locked REFIN=2omHzs=6 | | -5 | | ns 
Tsour__| LCLKO~toSOUT_~ | LoopLockedREFIN=20MHzS=8 | | 0 | | ns 
Teo-pan | SOuTtoPHI” | Ht | 8 | ts 
TNO __| NonoverlapTimePHittopHi2 | G=sopF | | || 
TRTF | Rise,FalTimePHit,PHI2 | (Note) | | a | 


Note 1: Rise and Fall times measured from 0.5V to VITL1 — 2V with C_ = 50 pF 
Note 2: This is not production tested but is assured by characterization to include sufficient margin beyond processing extremes. 


Circuit Operation 

The DP8513 Video Clock Generator Slave generates sever- 
al clock signals required in a video display system from two 
reference frequency inputs: REFCLK and REFIN. These sig- 
nals are provided by the DP8514 Crystal Clock Generator. 
The 2X REFCLK input is used as a resynchronizing clock for 
the 1X REFIN input thus eliminating duty cycle distortion 
introduced by the backplane. Among the clocks generated 
are a two-phase clock for driving the RGP and BPU proces- 
sors, ECL pixel and load clocks (PCLK and LCLK3) for high 
speed video shift register parallel load and shift operations, 
and TTL load clocks (LCLKO, LCLK1, and LCLK2) for mov- 
ing DRAM and FIFO data to the video shift registers. The 
LCLK and PCLK outputs are all internally synchronized in 
order to simplify system timing. 

The two-phase graphics processor clock (PHI1 and PHI2) 
operates at the frequency of the REFIN input. It is capable 
of directly driving the raster graphics and BIT-BLT proces- 
sors. The two-phase clock is closely aligned with the other 


D 


REFIN [>] FLIP/ FLOP 


) 


COUNTER 


_ clocks generated by the device to maximize system opera- 


tion. The PCLK and LCLK outputs are generated using a 
digital phase locked loop as shown in Figure 7. 


The loop consists of the S and L counters, a phase compar- 
ator, and a voltage controlled oscillator (VCO) with the rela- 
tionship between these elements in the loop defined as: 


REFIN XL 
S) 


where PCLK is the pixel clock frequency, REFIN is the 
REFIN frequency, L is the L Counter modulus, and S is the 
S Counter modulus. When the frequency of the VCO (PCLK) 
in the phase locked loop is stable the inputs to the phase 
detector are in phase, thus the S Counter and L Counter 
outputs are identical in both phase and frequency. As long 
as the REFCLK input is constant, the phase and frequency 
of the S Counter output remains constant. Any drift, or 
change in frequency, of the VCO will be divided down and 
appear as a shift in phase at the L Counter output. The 


PCLK = 


aw SOUT 


L 
COUNTER | ee 


TL/F/9283-3 


FIGURE 1. PLL Block Diagram 
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Circuit Operation (Continued) 


phase detector will sense this phase error and generate a 
correction voltage for the VCO input which is proportional to 
the magnitude of the frequency error. This correction volt- 
age will change the VCO frequency to eliminate the error 
and keep the loop locked by changing the capacitance of 
the varactor in the LC oscillator tank circuit. The varactor’s 
Capacitance is proportional to the amount of reverse bias 
applied across it. The correction voltage is provided by the 
OPAMP output which has a 3V typical operating range (VEE 
+0.5V to VECLO —1.2V). This operating range provides a 
+ 10% (typical) VCO frequency range. This restricts the use 
of multiple pixel rates. However, an external OPAMP can be 
added to extend the tuning voltage range or tank circuit 
components can be bandswitched in. 


The presence of the S Counter in the loop enables the 
graphics processor to operate at full speed independent of 
PCLK frequency. The video shift register’s parallel data 
width determines the L Counter modulus. An 8-bit parallel 
shift register would use an L Counter modulus of 8 so that a 
parallel load pulse occurs once every 8 pixel clocks. The L 
Counter output is used to derive the four LCLK outputs. 
These 4 LCLKs differ slightly in format to allow for the vari- 
ous system configurations highlighted in the following sec- 
tion. 


The ECL LCLK3 output is used in conjunction with the PCLK 
output to load data into a high-speed video shift register. 
The PCLK provides the clock and LCLK3 provides the load 
signal for the shift register. With a typical video shift register 
operating at frequencies around 200 MHz timing can be- 
come extremely critical. For this reason a Timing Adjust pin 
(TADJ) is provided that will allow the user to obtain the opti- 
mum LCLK3-to-PCLK setup and hold timing relationship. 
See Pin Definitions for further info. 


The other three LCLK outputs (LCLKO, LCLK1, LCLK2) are 
TTL outputs. They can be used to control a selection of 
different DRAMs/video shift register configurations as 
shown in the system architecture section which follows. 


Also included on the chip is a secondary phase locked loop 
which can be used to synchronize the graphics system to an 
external signal such as the horizontal sync pulse from a 
television broadcast. A block level diagram of this mode of 
operation is shown in Figure 2. An external oscillator (such 
as the DP8514) is configured to operate as a voltage con- 
trolled crystal oscillator (VCXO). Any change in frequency of 
the VCXO forces the outputs of the S Counter and the 


RGP’s horizontal counter to shift in phase relative to the 
external horizontal reference signal. The auxiliary phase 
comparator senses any phase difference between its two 
inputs and produces a correction voltage for the VCXO 
which is proportional to the amount of the frequency error. 


This correction voltage will change the VCXO frequency to 
eliminate the error thus keeping the loop locked. 


LOOP 
FILTER 
FIGURE 2 


For most applications the REFIN input (and thus PHI1 and 
PHI2) will be in the range of 20 MHz. With a typical 8-bit 
system this means that the PCLK output will be running at 
20 MHz X 8 bits = 160 MHz. However if the system is 16 
bits wide instead of 8 bits the PCLK frequency would be- 
come 20 MHz x 16 bits = 320 MHz which is beyond the 
range of the VCO. Therefore the S Counter block must be 
added to divide the REFIN input frequency down to a more 
manageable frequency. Using the equation and the above 
example of a 20 MHz REFIN input with 16 bits of data, the S 
Counter is used as a divide-by-two counter to get a PCLK 
frequency of 160 MHz. 


The S Counter can be programmed to divide by any integer 
up to 8 and the L Counter can be programmed for any word 
width from 4 bits to 64 bits in increments of 4. Table | shows 
some of the frequencies possible using various values for 
the S- and L-modulus. 


VARIOUS SYSTEM ARCHITECTURES 


Figure 4 demonstrates the DP8513 in a system using a vid- 
eo shift register and a DRAM. Another possible application 
uses a video DRAM or VDRA\M, in place of the DRAM, as 
shown in Figure 5. This system differs from Figure 4 in that 
the VDRAM contains an internal shift register which allows 


EXT VCXO 
(DP8514) 


TL/F/9283—4 


TABLE I. Partial Table of PCLK Frequencies 


8-Bit Word (L= 8) 16-Bit Word (L= 16) 
M ae PCLK Frequency PCLK Frequency 
| REFIN = 10 MHz | REFIN = 20MHz | REFIN = 10MHz | REFIN = 20 MHz | REFIN = 10MHz | REFIN = 20 MHz 


32-Bit Word (L.= 32) 
PCLK Frequency 
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Circuit Operation (Continued) 


memory to be randomly accessed while data is being output 
to the video shift registers through the serial port. The out- 
put of the VDRAM differs from the standard DRAM in that 
data is not valid at the output of the internal shift register 
until an SCLK pulse is provided. The offset in the LCLK3 
waveform from LCLK1 is obtained by connecting ENOUT1 
to ENINS. © 


The third type of system employs a video shift register (such 
as the DP8515) that contains an onboard FIFO as shown in 
Figure 6. This architecture simplifies timing requirements in 
a high-speed multiboard system. The propagation delay 
time from the DRAM to the video shift register through the 
back plane is no longer restricted to be less than one LCLK 
period. The VDRAM can be writing into the FIFO asynchro- 
nous to the data being loaded into the shift register. As in 
the previous systems, one LCLK1 pulse transfers the data 
to the VDRAM output. LCLK2 then writes several words into 
the FIFO prior to the LCLK3 reading out the first word. The 
connections required to obtain this timing are shown in the 
diagram. 

To implement a multiboard system the DP8513 is used in 
conjunction with the DP8514 and a Video Shift Register with 
a FIFO (such as the DP8515/16). The various LCLK signals 
from the DP8513 control the VRAM shifting and FIFO read 
and write operations to minimize the problems associated 
with backplane delays. | 


Typically the main board will contain a DP8513 Video Clock 
Generator Slave as well as a DP8514 Crystal Clock Genera- 
tor, the DP8515/16 Video Shift Registers (one per plane), 
and a main graphics processor (such as the DP8500). See 
the multiboard diagram which follows. The other boards will 
contain one or more planes of memory and the associated 
BIT-BLT processors (such as the DP8510), and one 
DP8514 to generate the two-phase clock for the proces- 
sors. 

The 2X and 1X clocks (XOUT and DATA OUT) provided by 
the DP8514 on the main board are used by the DP8513 to 
generate non-overlapping clocks for the graphics proces- 


LCLK3 


PCLK 


LCLK3 


PL 


sor. These 2x and 1x signals are also sent across the back- 
plane to the memory boards where other DP8514’s gener- 
ate non-overlapping clocks for the BIT-BLT processors. The 
display of information is started by an ENABLE signal from 
the RGP to the ENIN1 of the DP8513. The remaining ENIN 
and ENOUT pins are connected as shown in Figure 6 along 
with the resulting LCLK waveforms. LCLK3 and PCLK are 
used to read data from the FIFO and shift it to the DAC’s. All 
of the VSR’s must be on the main board to keep the 
225 MHz PCLK signal off the backplane. LCLK1 is sent 
through the backplane to the SCLK inputs of all the VRAMS 
and causes data to be shifted out to the VSR’s FIFO’s. 
LCLK2 is sent through the backplane to a buffer and then 
through the backplane again to the WR inputs of the FIFO’s. 
These two backplane delays are used to cancel the back 
plane delays from the DP8513’s LCLK1 pin to the VRAM’s 
SCLK pin and from the VRAM’s output to the FIFO’s input. 


Following the path from the LCLK2 pin of the DP8513 to the 
WR pin of the VSR, we find two backplane delays and a 
buffer prop delay which is assumed to be negligible in the 
following discussion. Note from Figure 6 that the LCLK3 
occurs three LCLK periods after LCLK2. This means that 
two backplane delays can be as much as three LCLK peri- 
ods, or Tap < 3/2 TicLk for no loss of data. To increase 
this time further one or more D Flip Flops may be inserted 
between ENOUT2 and ENINS as shown in Figure 3. Each 
flip flop will increase the allowable backplane delay by ap- 
proximately 1/2 LCLK period. 


TL/F/9283-5 


FIGURE 3 


DATA 


VSR 
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FIGURE 4. System Configuration and LCLK Waveforms Using DRAM 
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ENOUT!1 PL 


ENINS 
TL/F/9283-8 


PCLK 


LCLK1 


LCLK3 


| - TL/F/9283-9 
FIGURE 5. System Configuration and LCLK Waveforms Using VDRAM 
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PCLK 


LCLK1 


LCLK1 


LCLK2 


LCLKS 


TL/F/9283-11 
FIGURE 6. System Configuration and LCLK Waveforms Using a VSR Containing FIFO 
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Multi-Board Diagram 


MAIN BOARD 


BACKPLANE 


|” ATDATA | VRAM ju DP8514 


@ 
@ 
‘ E 
eae AUXILIARY MEMORY BOARD 


ee 
ae 
MAIN MEMORY BOARD 
a 
B 
E 
aes 
a 
ee 
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TABLE li. Counter Modulus Tabies 


L Counter Inputs 


r 


Ss ss —_ as ah 


re 


oh = ah a_k 


@ 


L = TTL Logic Zero (GNDO, 1) 
H = TTL Logic One (VTTLO, 1) 


1 1 


O = VEE 
1 = VECLO, 1 


Typical System Diagram 


RGP HORZ SOUT PHI ENIN1 
EXT REF 


OPAMP2 


REFCLK, 
REFIN ENOUT2 


ENOUTS 
LCLK3 


L CK WR 


P 
DP8515 VSR 


VIDEO DATA 
TL/F/9283-21 
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input Schematics 
Typical ESD Circuit 
TL/F/ 9283-22 
VCO Inputs 


VECLO 


VEE 


TL/F/9283~24 


Timing Adjust input 


» VECLO 


TL/F/9283-26 
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LO-L3 Inputs 


4KO. VECLO 


~ ao VECLO 
=~200 mV 
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TTL Inputs 
S0-S2, REFCLK, REFIN 


VTTLO 


GND1 GNDO 


TL/F/9283-25 


TTL inputs 
(ENIN1, 2, 3, RGP HORIZ, EXT REF) 


VTTLO 


ENIN1, 2,3 
RGP HORZ 
EXT REF 


GND1 GNDO 
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oO 
a "I 
Output Schematics ~ 
Pn) 
TTL Outputs ECL Outputs 
(ENOUT1, 2, LCKLO, 1, 2, SOUT) VECLI 
VTTL1 sie 
260 PCLK, PCLK 
LCLK3, LCLK3 
ie ENOUT3, ENOUT3 
VEE 
TL/F/9283-29 
ENOUT1, 2 
LCLKO, 1, 2 
SOUT 
wey GND! 
TL/F/9283-28 
PHI1, PHI2 Charge Pump Output/OP Amp Input 
' VITLI 


GND1 


PHI1, PHI2 


8 
OPAMP INPUT 
GND1 | DIFF PAIR 


TL/F/9283-30 
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Op Amp Output 
VECLO 


OPAMP1, 2 


VEE 
TL/F/9283-32 
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Typical Power Connections 
Single Power Supply Operation 


VTTLO VTTLI 


VECLO VECLI| 


1+4.5V TO +5.5V1 
| REGULATOR 


DP8513 


| GNDO  GND1 VEE CVE 
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P+4.5V TO #5.5 V1 i — 
| REGULATOR | VTTLO VTTL1 VECLO VECLI1 


DP8513 
| GNDO GND1 VEE __VEE 


f-5.7V 10 ~4.2V 
REGULATOR 
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Typical Applications 


Crystal Referenced System 
PCLK = 200 MHz, L = 20,S = 1 


Cy) 0.1uF | 
Ic2) Rt 27K 
1.0 uF 2000 
: cP01 OPAMP1 500 pF 
20 MHz ——4 XC XOUT | REFCLK vc01 
= 0.037 
ec | Dpssi4 DP8513 ul 
30 pF 30 pF 


TL/F/9283-35 
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External Referenced System 
PCLK = 200 MHz, L = 20,S = 1 


Cty) 1.0 uF 
Ri 
Trop] yyy 2000 
04 27K 
RGP HORIZ HF | OPAMPI 
SOUT CPO1 500 pF 
EXT SYNC EXT REF vcot 
DP8513 ee 
20 MHz [7 REFCLK ui 
CERAMIC | = {|DP8514 a 
RESONATOR | REFIN vC02 
500 pF JOPAMP2 CPO2 VCS 
7Ko C2’ 
: ; 10.1 yF 10 pF 
50 pF 35 pF Cl == 820 pF 


MV209 
VARACTOR 0:01 HF 30 pF 
RI. 75 KO 
VEE 


Loop Filter Calculations 


Several constants need to be known in order to determine 
the loop filter components. They are the loop divide ratio N, 
the phase detector gain Kp, the VCO gain Ko, the loop 
bandwidth >, and phase margin ¢. 


The constant Kp is fixed at 80 wA/rad for the DP8513. N is 
simply the L counter modulus for the main loop. For the 
secondary loop, N is the S counter modulus times any exter- 
nal division between the SOUT pin and the RGP HORIZ pin. 
(i.e., if S = 1 and there is a +100 counter between SOUT 
and RGP HORIZ, N = 1 xX 100 = 100.) A 60° phase mar- 
gin is recommended, however, the equations allow other 
values to be used if desired. 


The oscillator gain constant of Ky can be obtained from 
Table Ill or determined experimentally. This is done by driv- 
ing the 27k resistor which normally connects the varactor to 
the op amp output with an external power supply. Set the 
supply to Veg + 3V and note the PCLK frequency. Next set 
the supply to Veg + 2V and note the frequency again. The 
difference in these two frequencies (times 27r to convert to 
radians) is Kop. For optimum performance, the desired PCLK 
frequency should be somewhere between the two frequen- 
cies measured above. This may require adjustment of the 
coil. 


Before choosing a value of wo, one fact should be pointed 
out. The 27k resistor and the 500 pF coupling capacitor 
between the coil and the varactor form a low pass filter with 
a cutoff of about 12 kHz. Thus, the loop bandwidth must be 
chosen to be less than this value. We recommend 277 X 
100 Hz to 27 X 3 kHz for wo. 


Having found all these constants, the following equations 
are used to find the component values: 


_ 1.08 N wo _ 3.46 Kp Ko 


_ 0.27 Kp Ko 


R1 
N Wo? 
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To use a phase margin of other than 60°, use the following: 


Ay = N ®&o 


(cosec @ + 1) 


2 Ky K 
p Ko 
Ci = ———~ tan 
N wo? 


sec @ — tan 
Naga eC ) 
Example: Design a system with the following characteristics: 

External horizontal sync of 100 kHz 

1560 pixels per line (2000 pixels including retrace) 

20 bit wide video data 

10 MHz processor rate 
Note that this system will sync to an external source so that 
both loops must be used. 
The PCLK frequency wil be 100 kHz x 2000 pixels per line 
= 200 MHz. The components in Table III will be used. Note 
that Ky = 24 Mrad. Because it is a 20 bit wide system the L 
counter modulus must be 20. 
By choosing w> = 27 ® 2800 Hz, the equations give R1 = 
2000, C1 = 1.0 uF and C2 = 0.08 pF (use C2 = 0.1 pF). 
In the secondary loop, a ceramic resonator is used in place 


C2 = 


of a crystal to allow more pullability. Its Kg is found experi- 


mentally to be 84 krads/V. The SOUT frequency will be the 
same as LCLKO or 200 MHz + 20 = 10 MHz. Thus, there 
must be a +100 counter between SOUT and RGP HORIZ 
so N = 100 for the secondary loop. 


In choosing ao, it should be noted that wo for the secondary 
loop should be smaller than wo, for the primary loop so that 
the main loop will be able to track the secondary without 
loosing lock. Picking wa) = 27 X 750 Hz gives R1’ = 75k, 
C1’ = 0.01 pF and C2’ = 820 pF. 
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Recommended VCO Components 


27K 


VARACTOR 


TL/F/9283-37 
1 


| 
FvCo Fafror TOT 


CA . CyARACTOR 


TABLE Ill. Recommended VCO Components 


(MHz) pF aaa Mrad/voit 
Part # 


[60 | ose | esoars-sooo02e | se [a0 | _wvaoo | 16 
| 80| 0.17 | Es02HNS-4ooo024 | 30 | 30 | Mvaoa | 19 
| 100 | 0.12 | eso2HNs-soo00zs | so | 30 | mvzoo | oat 
| 120 | 0.07 | esoans-20000z2 | 30 | so | mveoo | at 
| 140 | 0.07 | esozHNs-2000022 | 22 | so | mvzoa | a7 
| 160 | o.07 | Es0zHNS-2000022 | 15 | 15 | Mvze05 | a7 
| 180 | 0.07 | Esc2HNS-2000022 | 10 | 15 | _Mvza0s | 26 
| 200 | 0.037 | Es02HNS-ro00020 | 10 | 30 | mvzos_ | 24 
| 220 | 0.037 | Es0zHNS-1000020| 10 | 15 | Mvze05 | 34 
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DP8514 Crystal Clock Generator 


General Description 


The DP8514 Crystal Clock Generator consists of a crystal 
or LC tank oscillator and a synchronizer/2-phase nonover- 
lapping MOS clock driver. It is designed to interface directly 
with the DP8513 Video Clock Generator in multiboard 
graphics applications. However its features and flexible de- 
sign allow it to be used in numerous other applications as 
well. 


There are two outputs from the Pierce crystal oscillator. One 
is the same frequency as the crystal and has an approxi- 
mate 50% duty cycle while the other is half the crystal fre- 
quency with a 50% duty cycle. Both of these outputs are 
TTL-compatible. The oscillator may also be used as an LC 
oscillator, if desired. 


The other section of the die contains a resynchronizer with 
additional clock follow logic, and a 2-phase nonoverlapping 
MOS-compatible clock driver. Both sections of the die may 
be used independently of each other. The synchronizer is a 
D register which has a clock input, REFCLK, a data input, 
REFIN, and a mode control input, SEL, which allows the 


REFCLK input to contro! the synchronizer’s output. This fea- 
ture allows either the clock or the resynchronized clock + 2 
to be fed to the MOS clock driver. A TTL output (SYNC 
OUT) in phase with the PHI1 output is also provided. 


Features 

m Pierce oscillator may be used with crystal, ceramic res- 
onator, or LC tank circuit. External varactor allows VCO 
or VCXO mode. 

m TTL-compatible oscillator and oscillator +2 outputs. 

m Two-phase nonoverlapping MOS-compatible clock out- 
puts drive 50 pF loads at 20 MHz. 

Synchronizer/driver eases synchronization of PHI1 and 
PHI2 clocks on multiple boards. 

m TTL-compatible SYNC OUT in phase with PHI1. 

@ Available in standard 16 pin DIP, 16 pin SO, and 20 pin 
PCC packages. 


Block Diagram pin numbers are for 20 pin PCC (16 pin DIP) package 


VCCC GNDC 
17(13) 2(2) 
e. 6 


TP VCXO. 


ORT GD) a> 
XB 20(16) > 


VCCB 16(13) © 


GNDB 3(2) © 


SEL 10(8) > 


REFCLK 19(15) [> 


REFIN 18(14) (> 


VCCA GNDA 
15(13) 14(12) 
e @ 


> 12(10)X OUT 


P™S> 13(11) DATA OUT 


> 11(9) SYNC OUT 


© 6(5)VCC I 
> 4(3)PHI1 
© 5(4)GND1 
© 7(5)VCC2 
> 9(7)PHI2 
© 8(6)GND2 
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Connection Diagrams 


Pin Descriptions (parenthesis indicate 16 pin DIP) 
XC, XB: External connections for the Pierce os- 


1,20, 


(1, 16) 


2,3 
(2) 


PCC Package 


GNDB 
GNDC 


.S) 
=< 


[ae] 
=< 


PHI2 
SEL 
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Order Number DP8514 
See NS Package Number M16A, N16A or V20A 


cillator. XC requires a 5.6 k© resistor to VCCC. 


GNDC, GNDB (GNDB): Power supply return for 
the crystal oscillator, and power supply return 
for all circuitry except the VCXO and output buff- 
ers. The 16 pin version has a single power sup- 
ply return for all circuitry except the output buff- 
ers. 


PHI1, PHI2: MOS-compatible two-phase non- 
overlapping clocks. The frequency of these sig- 
nals is that of the REFCLK input when SEL is 
high and that of the REFIN input when SEL is 
low. 


GND1, GND2: PHI1 and PHI2 output buffer pow- 
er supply return. 


vec1, VCC2 (VCC1): PHI1 and PHI2 positive 
power supplies specified for operation at 5V 
+10%. 

SEL: TTL-compatible MUX control input selects 
either the REFCLK input or the REFIN input (re- 
synchronized to REFCLK by a Flip Flop) to be 
passed to the PHI generator circuitry. A low se- 
lects the REFIN input frequency and a high se- 
lects the REFCLK input frequency. 


1-154 


11 
(9) 

12 
(10) 
13 
(11) 
14 (12) 
15, 16, 17 
(13) 


18 
(14) 


19 
(15) 


Dual-in-Line Package 


1 
2 
3 
4 
5 
6 
7 
8 
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SYNC OUT: TTL-compatible output equivalent 
to PHI1. 


X OUT: TTL-compatible output of the crystal os- 
cillator. 


DATA OUT: TTL-compatible output whose fre- 
quency is that of the crystal divided by two. 


GNDA: TTL output buffer power supply return. 


VCCA, VCCB, VCCC (VCCA): TTL output buff- 
er, internal circuitry, and crystal oscillator posi- 
tive power supply, respectively. The 16 pin ver- 
sion has a single positive power supply for all 
circuitry except the PHI1 and PHI2 buffers. 


REFIN: TTL-compatible input typically used to 
generate PHI1 and PHI2. Equivalent to the RE- 
FIN input on the DP8513. 

REFCLK: TTL-compatible input typically used to 
synchronize multiple DP8514’s. This is the CK 
input of a positive edge triggered D Flip Flop. 
Equivalent to the REFCLK input on the DP8513. 


Absolute Maximum Ratings 


lf Military/Aerospace specified devices are required, Storage Temperature -65°C to + 150°C 
please contact the National Semiconductor Sales Voc 7.0V 
Office/Distributors for availability and specifications. Inputs 7.0V 
Outputs 7.0V 

ESD Sensitivity (Note 2) 1000V 


Recommended Operating Conditions 


ie ee 

lon Low Level Output Current a: ee re ee 
wos [| 

ae 


Units 


MHz 
MHz 


: 


DC Electrical Characteristics 


Symbol | __—Parameter_—|_——Condiitions, =| Min’ | Nom_| 
VoH High Level Output Voltage Tee 

TTL Outputs lon = —0.4mA CG 

High Level Output Voltage | lon= —04mA | Voc 23 | | 

MOS Outputs lon = —100 wa | Voo-04 | 
VoL Low Level Output Voitage 

TTL Outputs lol = 8mA 


Low Level Output Voltage lo. = 20 mA iS ies 
I Max High Level Input Current VTTLO, 1 = 5.5V, Vin = 7V a Tee 


5.5 
—0.4 
40 
40 


3 
> 


< 


=e TE 
> | > 


HH High Level Input Current VTTLO, 1 = 5.5V, Vin = 2.7V 
He Low Level Input Current VTTLO, 1 = 5.5V, Vin = 0.4V 


lo Output Drive Current 
TTL Outputs Vo = 2.25V 
Output Drive Current | 495 
MOS Outputs Vo = 2.25V 


loc Supply Current Voc = 5.5V 20 | 


MOS = PHI1,2 outputs TTL = X OUT, DATA OUT, SYNC OUT 


= 
> 


mA 


mA 


AC Electrical Characteristics 


symbol | __Parameter, =| = Condiitions, = 
Fmax | Max Oscillator Frequency eee 
(Note 1) | Max Resynchronizer Clock Frequency | 40% < Fin Duty Cycle < 60% 


Units 


MHz 


Dc X OUT Duty Cycle are 
PW Pulse Width High; PHI1, 2 REFCLK = 40 MHz, REFIN = 20 MHz 
TNo Non-overlap Time PHI1 to PHI2 CL = 50 pF 


% 


ns 


NO 
© 


ns 


ns 


om me) 
< 
S 
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AC peo Characteristics (Continued) 


Symbol _ 
Tr. Te 
Tsyne a ee 
Ten |_REFCLKtOPHI | 

iC ee ee RE 
TSE 


| Note 1: This parameter is not production tested but is assured by characterization to include sufficient margin beyond processing extremes. 
Note 2: Human body model; 120 pF thru 1.5 k. 


AC Timing Waveforms 


DP8514 


—_ 
Oo 


X OUT 


DATA OUT 
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REFIN “ ‘ 
Igy Ty 


REFCLK 1.5¥ 


SEL 1.5¥ 


PHI1 
Two Tho 
PHI2 .o. 


Tsync Tsync 
SYNCOUT 1.5V | | 


Units 


ns 


ns 


ns 


ns 
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Functional Waveforms 


Resynchronizer Mode 


REFIN 


REFCLK 


SEL 


SYNC OUT 
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Functional Waveforms (Continued) 
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Clock Follow Mode 


OOK 


VY 


REFIN /\ 
REFCLK 
SEL 
SYNC OUT 
TL/F/9284-11 
Wiring Diagrams 
y 5.6 kQ 
a“ mn ~ CONTROL VOLTAGE (>0V) 
DP8514 
a i : 
= Ss TL/F/9284-7 
C1, C2 dependent on crystal ~ 30 pF 
FIGURE 1. Crystal Oscillator | —— 
. TL/F/9284-8 
1 C1 xX VAR. 
Fx ouT = OmGy where Cy = C17 VAR given 0.1 pF > Cry 
FIGURE 2. VCO 
Veec 


CONTROL VOLTAGE (>0V) 
50kn. 
TTL COMPATIBLE 
0.1 #F =) CERAMIC Pt 
vis deat bea ae }wos COMPATIBLE 
Ct +r 


— TL/F/9284-9 


Fsync out = FcERAMIC RESONATOR * 2 
FIGURE 3. VCXO with MOS Compatible Outputs 


Typical Applications 


REFIN 
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REFCLK 


REFIN 
XOUT DP8513 
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REFCLK 


, TL/F/9284~13 
FIGURE 4. Single Board System 
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Typical Applications (continued) 
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FIGURE 5. Multiboard System 


1-158 


OO 
<x 
Bxe REFCLK PHI1,2 | 
0,4 5 
; OO 
ren 
mah 6) 


TL/F/9284-—14 


National © 
Semiconductor 


DP8515/DP8515-350/DP8516/DP8516-350 


Video Shift Register (VSR) 


General Description 


The DP8515/DP8515-350/DP8516/DP8516-350 Video 
Shift Register (VSR) provides the functions of a high speed 
sixteen bit shift register and parallel data input latches/flip- 
flops required in high performance raster scan video sys- 
tems. Also on the VSR are four words of FIFO which by 
means of the mode control input pins MO and M1 may be 
placed in front of the shift register if the user so desires. 


The VSR has three operating modes; inputs configured as 
transparent latches, inputs configured as flip-flops (one 
word FIFO mode), and four word FIFO mode. As mentioned 
above, the mode control input pins, MO and M1 select in 
which mode the part is operating. In all three modes, the 
WR input allows data into the part, the PARALLEL LOAD 
input loads data into the shift register, and the PIXEL 
CLOCK input shifts data out of the shift register. 


In the four word FIFO mode, four write operations may oc- 
cur before a shift register load operation is required in order 
to avoid writing over previously written data. The four words 
of FIFO significantly ease the timing constraints which are 
present when working with high speed multiple board sys- 
tems. 


The VSR has a HOLD input which, when activated, inhibits 
the shift function of the shift register. Two other inputs, 
OUTPUT CONTROL and OUTPUT LEVEL CONTROL, hold 
the last bit of the shift register at a level chosen by the user 
while allowing the internal bits of the shift register to contin- 
ue shifting. Another control input, ENABLE, causes the par- 
allel loading of the shift register to be inhibited when deacti- 
vated. 


The inputs to the VSR, except those associated with the 
shift register, are TTL compatible. The shift register in- 


| Connection Diagram 


PIXEL CLK 


ré) PIXEL CLK 


Veg 
Vee | 
HOLD Veet 
HOLD | EG AC 
SI 

SI 

TTL GND | 
ENABLE 
Mo 
D15 
D14 | 
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puts and control signals are ECL compatible as are the out- 
puts. Furthermore, two versions of the chip are available; 
the DP8515/DP8515-350 has ECL outputs which are 10K 
compatible and the DP8516/DP8516-350 has ECL outputs 
which are 100K compatible. All the ECL inputs and outputs 
are differential, however, a Vag reference output is provided 


for the user who wishes to use only single ended signals. | 


The VSR implements all the TTL to ECL conversions and 
gives the customer the choice of using positive or negative 
supplies for the ECL circuitry. When using positive supplies, 
the ECL and TTL may be operated off of the same, single, 
+5V supply. 


Features 

m@ TTL compatible parallel data inputs 

mw Data inputs may be used as transparent latches or flip- 
flops 

mw Four words of FIFO available—essential for high speed 
multiple board systems 

m Accepts input data at rates up to 20 MHz 
(30 MHz for —350) 

m Tap at eighth bit, allows use as 2 8-bit shift registers 

m ECL inputs and outputs may be differential or single 
ended 

mw TTL to ECL conversion performed on chip 

w@ Shift register clock rate of 225 MHz 
(350 MHz for —350) 

mw Can use positive or negative ECL supplies 

m@ Entire chip can operate off of single +5V supply 

m@ Total chip Icc less than 200 mA 

m Packaged in a 44-pin PLCC 

mw A member of National’s Advanced Graphics Chip Set 


Order Number DP8515V, 
DP8515-350V, DP8516V, 
or DP8516-350V 
See NS Package Number V44A 
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DP8515/DP8515-350/DP8516/DP85 16-350 


Block Diagram 


> WR INPUT DATA INPUTS 
INPUT LATCHES/FIFO WORD 1 
FIFO WORDS 2 TO 4 
| LATCH/FIFO SELECT aren 
| © OLC 
Vee 2 , Ly ply ENABLE 
| 1 BIT a + BIT : 
6 
ee, ECL SHIFT REGISTER 1 'g 4 19 PL/PL 
PIXEL CLK m , L 
: : | | 0¢/0C 
aA Re hs 
@ om 
s0/S0 
HOLD/HOLD 38/85 


Pin Descriptions 


36, 37 Veo1—This is the supply for the collectors. of the ECL 
emitter follower outputs. Both the 10K and 100K options are 
specified for a supply from 4.2V to 5.5V, assuming a positive 
supply is used. This allows use of the standard 5V +10% 
supply, 5.2V +5% 10K supply, or 4.5V +0.3V 100K supply. 
For a negative supply, these pins are at ground potential. 


42 Voco—This is one supply for the most positive rail of the 
ECL circuitry. It is a separate supply for the output buffers 
used to reduce noise coupling. The ranges are the same as 
those specified for Voci. 


43 Veco—This is the supply for the most positive rail of the 
ECL internal circuitry. The ranges are the same as those 
specified for Voc. 


32 TTL Voec—This is the positive supply for the TTL circuit- 
ry. The supply is specified at +5.0V +10%. 


13 TTL GROUND 


7,8 Vee—This is the most negative rail for the ECL circuitry. 

Using a positive supply, this pin is at ground potential. For a 

negative supply, both the 10K and 100K options are speci- 

fied from —4.2V to —5.5V. This allows use of the —5.2V 
+5% 10K supply or the —4.5V +0.3V 100K supply. 


44 Vag—This'is the bias reference for the ECL inputs. Ail of 
the ECL inputs are differential; however, if single ended use 
is desired, the unused input may be tied to this Veep pin. This 
pin is nominally set at Vcco — 1.3V at room temperature for 
10K ECL and Voco — 1.3V over temperature for 100K ECL. 
See the electrical characteristics table for the exact en: 
cations for Vep. 


16-31 DATA INPUTS—These are TTL compatible inputs 
designed to meet the ALS specifications (as are all TTL 
inputs on this circuit). The data present at these inputs will 


TL/F/8684~2 


be converted to serial data by the ECL shift register. Data 
on pin 31 (DO) will be the first bit shifted out of the shift 


_ register. The data on pin 16 (D15) will be the last bit shifted 
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out of the shift register. Data at these inputs must meet the 
setup time requirements for the WR input. 


15,34 MODE CONTROL INPUTS—M0, M1—These two 
TTL-compatible inputs control whether the data inputs are 
transparent latches or edge triggered flip flops. They also 
control whether or not the four word FIFO is placed in front 
of the shift register. With MO set high the FIFO is not pres- 
ent and the data inputs are transparent latches. When MO is 
high the level of the M1 input does not matter. With MO low 
and M1 low the FIFO is again not present and the inputs are 
edge triggered flip flops. Actually this is equivalent to there 
being one word of FIFO present. A low level on MO and a 
high level on M1 results in the four word FIFO being placed 
in front of the shift register. Although a power up reset is 
present for the FIFO address counters, if the user wishes to 
reset the FIFO address to word zero, a low pulse on M1 
while the PARALLEL LOAD input is in a high state will ac- 
complish this. 


33 WR INPUT—This is a TTL compatible input. When data 
inputs are configured as flip flops, data on the bus is latched 
into the input flip flops on the positive edge of the WR input. 
If the inputs are configured as transparent latches, data is 
passed on to the shift register inputs so long as the WR 
input is low. If the four word FIFO is being used (MO0=0 and 
M1 = 1), data on the bus will be latched into the first word on 
the positive edge of the WR input. | 


Pin Descriptions (Continued) 


6,5 PIXEL CLOCK, PIXEL CLOCK—These are the differ- 
ential clock inputs for the ECL shift register. If single ended 
use is desired, one input should be connected to the Veg 
pin. The first positive PIXEL CLOCK transition after a posi- 
tive transition on the PARALLEL LOAD input loads into the 
ECL shift register the sixteen bit word present at or already 
latched into the inputs by the WR input. Subsequent positive 
PIXEL CLOCK transitions will shift the remaining 15 bits out 
of the shift register. 


4,3 PARALLEL LOAD, PARALLEL LOAD—These are the 
differential load inputs for the ECL shift register. If single 
ended use is desired, one input should be connected to the 
Vee pin. When using the PARALLEL LOAD function, the 
setup time between PARALLEL LOAD and the positive go- 
ing edge of PIXEL CLOCK must be met. To load a sixteen 
bit word from the input (latched or just present at the input 
depending on mode of operation) to the ECL shift register, a 
positive edge on the PARALLEL LOAD input is required and 
the first positive going edge of the PIXEL CLOCK following 
this positive edge will load the data into the shift register. 


12,11 SERIAL INPUT, SERIAL INPUT—These are the dif- 
ferential serial data inputs to the ECL shift register. If single 
ended use is desired, one input should be connected to the 
Veep pin. These inputs may be used for expansion to a wider 
bus system where they are connected to the SERIAL OUT- 
PUT, SERIAL OUTPUT of the previous shift register. The 
information on these input pins is shifted into the shift regis- 
ter on the positive edge of the PIXEL CLOCK. 


39,38 SERIAL OUTPUT, SERIAL OUTPUT—These are the 
differential data outputs of the ECL shift register. They are 
ECL compatible and are capable of driving 502 loads. Ter- 
mination resistors are required when they drive the SERIAL 
INPUT pins of the next shift register in an expanded bus 
system. The first bit shifted out is DO. 


40,41 $8, S8 OUTPUTS—These are the differential data 
outputs of the ECL shift register for D8. If an eight bit wide 
word is used instead of a sixteen bit wide word, this output is 
the first bit of the second word. By providing this output, the 
number of Video Shift Registers required for eight bit wide 
systems is cut in half. 
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10,9 HOLD, HOLD—These are the differential ECL inputs 
used to inhibit the shifting of the ECL shift register. A high 
level on the HOLD input will disable the shifting of the ECL 
shift register. If single ended use is desired, one input 
should be connected to the Veg pin. When using the HOLD 
function, the setup time between HOLD and the positive 
going edge of the PIXEL CLOCK must be met. When HOLD 
is released, the output data will not change until the follow- 
ing positive edge of the PIXEL CLOCK. The HOLD function 
can be used as a simple way of doing a “zoom” operation in 
graphics systems. 


2,1 OUTPUT CONTROL, OUTPUT CONTROL—These are 
the differential ECL inputs which allow the user to manipu- 
late the shift register data if he so chooses. If single ended 
use is desired, one input should be connected to the Vep 
pin. With the OUTPUT CONTROL input high, the last bits of 
the shift register, that is the SERIAL OUTPUT SO and the S8 
bit are held at a level determined by the OUTPUT LEVEL 
CONTROL input while all the other bits are shifted by the 
PIXEL CLOCK. As with the HOLD input, the setup time be- 
tween OUTPUT CONTROL and the positive going edge of 
the PIXEL CLOCK must be met. This feature provides for a 
simple way of implementing a scrolling function. Since all 
the internal bits of the shift register are shifting, the user can 
effectively control the shift of the data on the screen. 


35 OUTPUT LEVEL CONTROL—This TTL compatible input - 


selects the level to which the output bit of the shift register 
will be set when the OUTPUT CONTROL inputs are activat- 
ed. A high level on this input sets the output to a one while a 
low level sets the output to a zero. 


14 ENABLE—This TTL compatible input, when taken to a | 


high level, inhibits the PARALLEL LOAD operation of the 
ECL shift register. So long as the ENABLE input remains 
low, every positive edge of the PARALLEL LOAD input fol- 
lowed by a positive edge of the PIXEL CLOCK results in a 
load operation on the shift register. When the ENABLE input 
is taken high the next PARALLEL LOAD command is ig- 
nored and all subsequent PARALLEL LOAD commands are 
ignored until the ENABLE input is taken low. Once taken 
low, the first PARALLEL LOAD positive edge and PIXEL 
CLOCK positive edge pair will result in a load operation. 
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Absolute Maximum Ratings 


If Military/Aerospace specified devices are required, ECL Signals 
please contact. the National Semiconductor Sales Inputs (Using Negative Supply) Veg to + 0.5V 
Office/Distributors for availability and specifications. Inputs (Using Positive Supply) GND to Vcc +0.5V 
Storage Temperature : —65°C to + 150°C Output Current (DC Output High) —50 mA 
TTL Signals Supply —7V to +0.5V 
outs: 7 7 (Vee to GND Using Negative Supply) 
puts 
Supply ae 7" Supply _ —0.5V to +7V 
ef | (Vcc to GND Using Positive Supply) 
ESD Susceptibility (Note 4) 1500V 


Operating Conditions (Notes 1,3 and 5) Ta = 0°C to +70°C 


iti Max 
Symbol Parameter Conditions | Min Typ | Max 
pPe616/16] —360|  [DP8515/16] — 960] 


Voo_|TSweySSC~—~S~SSSC‘“‘“(RS «AOC a CO] 
{pixel clk.) 


a oe 
een ae 
ee ae 
ane aE 
fwitey [WRRate 
ae ee 
ae Sere 
a 
oma! 
aa 
eee 
ae 


4.2 
| 25 
| 30(25)_| 


Units 


i 

icons) [PARALLELLOAD Rate FIFOMods ——SSidCSC*d [14 [es (00) wii 
ia [20 [90 
= ris | fs 
te 30 (05) feo] | | | ms 
“7 222) [rap | | | ns 
a” [a2 (62) [asenl | | | ns 
we Width of PARALLELLOAD input il Transparentwods || 15 |e [|| | 
tw? [Width of PARALLEL LOAD input LO Transparent Mode | so(ee) feof [|_| ns 
isu [Setup Time HOLDtoPMELCLK |i t ft | fd 
tsye [Setup Time OUTPUT CONTROL toPELGLK | —~+| 30 [26 [| | ns 
tsva [Setup Time PARALLELLOADIoPXELCLK. =| Sits ft | | 
tsus [Setup Time SERIAL INtoPELGLK Sid ts 0 
tsys [Setup Time DATAToPARALLELLOAD———~YSCSCS*dtSCa to | Pd 
isyy__[SetupTimeDATAwOWRSSCiSSSCSC*iSts a PP 
teue [Setup Time ENABLE Actve to PARALLELLOAD |_| 12 | 0 | | [| [ns 
i [Hold Time HOLDtopMELCLK——|SSC*dCe fe | td 
tue [Hold Time OUTPUT CONTROLtoPMELGLK | +10 fos | [| ns 


iia [Hold Time SERIAL INToPIMELCLK——SSSPSSSC*dCt a | Pi 
ius |Hold Time DATAToPARALLELLOAD——S«d|~SCSCiY S| | | di 
tue [Hold Time ENABLE inactive toPARALLELLOAD [| _0 | 0 | || [ns 
tw7___|HoldTimeDATAtOWR —SSSC*PSCSC“‘“‘“(RCNC | TS 
tue [Hold Time ENABLE Active toPARALLELLOAD [| 0 | 0 | || [ns 


Note 1: See timing waveforms for relevant signal edges (positive or negative) from which all setup measurements are made. 


Note 2: This is the time that a write operation on the WR input must precede a read operation on the PARALLEL LOAD input when in the one word FIFO mode 


(edge triggered flip flop inputs) and when in the FIFO mode and the FIFO is empty. 


Note 3: Numbers in parenthesis are guaranteed when using a negative ECL supply with a Vee max of —4.75V or when using a positive ECL supply with a 


Voci/Voco min of 4.75V. 
Note 4: Human body model; 120 picofarads thru 1.5 kn. 
Note 5: Ali measurements are made WRT 1.3V level on TTL signals and Voco — 1.3V on ECL signals. 
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DC Electrical Characteristics 
TTL INPUTS AND SUPPLY (Note 3) 


symbol | Parameter | Conditions| Min. | Typ Units 
Vic___|_InputGlamp Voltage | Voo=Min,y= -18maA_ | | | =18 | 
Vin | High LevelinputVottage | Voo=Mex | 
Vu | _LowLevelinputVottage | Voc=Mex | 
im | High Levelinput Current | Voc=Max,Vi=27V | 
} |_MaxtHigh input Current | Voo=Max.Via=7V | | 
m____ | LowLevelinput Current | Voo=Max,vi=04v | 
loco | TTLSupplyCurrent | Voo=Max | 


ECL INPUTS/OUTPUTS DP8516/DP85 16-350 (100K) Vee = —4.2V Output Load = 502 to —2V (Note 2) 


Symbol_| ___—Parameter_——| Conditions| Min | Typ 
Vou __| OutputHigh Vottage | (Notest.2) |= 1025 | -950 | 880 | mv 
Vor.___| OutputLowVottage | (Notes1,2) | 1810 | -1700_| -1620 | mv 
Min | _inputHigh Vottage | (Notes ,2) | 865 | | 880 | mv 
Vi____|_ImputLow Voltage | (Notest.2) | | |= 875 |v 


VeB Bias Output (Notes 1, 2) 4465 4300 4175 ay 
Isink/IsoURCE < 1MA 


< 
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lH High Level Input Current Vin = Vin(max) ae hae 100 pA 
n Low Level Input Current | Vin = Vit (min) | -10 | | 100 pA 


ECL INPUTS/OUTPUTS DP8515/DP85 15-350 (10K) Veg = —5.2V Output Load = 502 to —2V (Note 2) 


Symbol Conditions | min | Typ_| Max | Units 
0°C 


VoH Output High Voltage — 1000 — 840 mV 
— 960 —810 mV 
— 900 —720 mV 


25°C (Note 2) 
70°C 


VoL Output Low Voltage 0°C — 1870 — 1665 mV 
25°C (Note 2) — 1850 — 1650 mV 
70°C — 1830 — 1625 mV 
ViK Input High Voltage o°c —1145 —840 mV 
25°C (Note 2) —1105 —810 mV 
70°C — 1045 —720 | mV 
VIL Input Low Voltage 0°C — 1490 mV 
25°C (Note 2) — 1475 mV 
70°C — 1450 mV 
Ves Bias Output °C — 1480 — omvV 
25°C (Note 2) — 1465 mV 
70°C — 1440 mV 


Isink/lsouRCE < 1mA 


liq High Level Input Current Vin = Vin(max) as 
Nie Low Level input Current Vin = Vit(min) | -100 | = | 100" pA 


ECL 10K/100K SUPPLIES 


Conditions 


ECL Current Voc = Max., Outputs Open 


Note 1: These ECL 100K specifications are guaranteed over the temperature range 0°C to 70°C. 
Note 2: ECL voltage levels are referenced to Vcco. 
Note 3: TTL voltage levels are referenced to TTL Ground. TTL specifications are guaranteed over the temperature range 0°C to 70°C. 


1-163 


DP8515/DP8515-350/DP85 16/DP8516-350 


AC Electrical Characteristics 


fax |PARALLEL LOAD (read) | FIFO Mode 


fax PARALLEL LOAD (read) | Transparent Mode 
tpp4 PIXEL CLK to SO orS8 | wrt PIXEL CLK POS EDGE 
tppo PIXEL CLK to SO or S8_| wrt PIXEL CLK POS EDGE 


po Min | tye x ly 


DP8515/16| -350 | 
P| | 
(Note 1) | (Note 2) 
| 20 | 30 
Pa fas 


DP8515/16| -350 |DP8515/16 


= 
” 


rw | 

“SEN 
= 
Be 
N 


ns 


Note 1: fax is not tested but is assured by correlation with characterization data. fryax of 350 MHz typical is at room temp. and 5.0V. Typical parts run at 310 MHz 


over temp. and Vcc. 


Note 2: fyyax is not tested but is assured by correlation with characterization data. fyax of 450 MHz typical is at room temp. and 5.0V. Typical parts run at 400 MHz 


over temp. and Vcc. 


Circuit Operation 


When the VSR is powered up, on chip power up reset cir- 
cuitry resets the FIFO write and read pointers. This is only 
necessary if the FIFO mode (MO = 0, M1 = 1) is selected, 
although it is performed on every power up. Although no 
random information in the FIFO is cleared, this is not neces- 
sary since it will be written over. With the FIFO reset, and if 
the FIFO mode is selected, the circuit is now ready for a 
write operation into word number one. If, during the opera- 
tion of the circuit, a reset of the FIFO is desired, this may be 
accomplished by applying a low pulse to the M1 input. To 


When the MO input is low and the M1 input is low, the FIFO 


_ depth is reduced to one word; that is, the DATA inputs be- 


ensure proper internal circuit operation, starting this FIFO | 


“reset” operation while the PARALLEL LOAD input is in a 
HIGH state is NECESSARY. Again, the circuit must be in the 
FIFO mode to do this. 


Writing into the FIFO is accomplished on the positive edge 
of the WR input. When this occurs, data at the input which 
has met the specified setup time will be latched into the first 
word of the FIFO. Up to four write operations may be per- 
formed without a read operation; that is, without a positive 
edge of the PARALLEL LOAD input. If more than four con- 
secutive write operations occur, previously written data will 
be overwritten. The WR input and the PARALLEL LOAD 
input may be asynchronous and writing and reading may 
occur simultaneously. However, when the FIFO is empty, if 
the user wants to read the new data about to be entered, 
then the read operation must occur no sooner than one WR 
to Parallel Load Setup Time after the write operation. If this 
condition is not met, the old data will be read from the FIFO. 


So long as the ENABLE input is low, reading data from the 
FIFO will not be inhibited. Reading data out of the FIFO is 
accomplished on the positive edge of the first PIXEL 
CLOCK following a positive edge of the PARALLEL LOAD 
input which has met the specified setup time. The first word 
read will be the first word written into the FIFO. The sixteen 
bit word will be parailel loaded into the ECL shift register 
and Bit 0 will appear at the SO SERIAL OUTPUT. Subse- 


quent PIXEL CLOCK positive edges will shift the data out of | 


the shift register so long as the HOLD input is low; if this 
input is high the shifting of data by the PIXEL CLOCK will be 
inhibited. With a positive edge of the PARALLEL LOAD in- 


come edge triggered flip flops. With the inputs configured as 
edge triggered flip flops, data which meets the specified 
setup time is accepted on the positive edge of the WR input. 
As is the case in the four word FIFO mode, data will be 
loaded into the ECL shift register on the first positive edge 
of the PIXEL CLOCK following a positive edge of the PAR- 
ALLEL LOAD input. Since the FIFO depth is one word, it is 
essential that each write operation precedes a read opera- 
tion by the WR to Parallel Load Setup Time. Also, in this 
mode, every write operation must be followed by a read 
operation if the user does not want to write over any data. 
All of the operations pertaining to the shift register remain 
the same in this mode as in the four word FIFO mode. 


When the Mo input is high, and the M1 input is at any value, 
the FIFO is disconnected and the data inputs become trans- 
parent latches. In the transparent latch mode information at 
the data inputs is passed through to the shift register paral- 
lel load inputs so long as the WR input is low. When the WR 
input is taken high, the information at the data inputs is 
latched into the input buffers. 


As in the other operating modes, data is loaded into the shift 


- register on the positive edge of the first PIXEL CLOCK fol- 


put occurring every sixteen PIXEL CLOCKS, sixteen bit 


words will be shifted out of the shift register. If more than 
sixteen PIXEL CLOCKS occur before a PARALLEL LOAD 
positive edge, then the information at the SERIAL INPUT 
pin will be shifted out of the register. This input can be used 
to cascade shift registers for longer word lengths. 
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lowing a positive edge of the PARALLEL LOAD input. If the 
WR input remains low, new data is loaded into the shift 
register following every PARALLEL LOAD/PIXEL CLOCK 
positive edge pair providing the data meets the setup time 
with respect to the PARALLEL LOAD input. If the WR input 
returns high after latching in the information at the data in- 
puts, PARALLEL LOAD/PIXEL CLOCK positive edge pairs 
will load the shift register with the latched data. All of the 
operations pertaining to the shift register remain the same in 
this mode as in the other modes. 


The OUTPUT CONTROL and OUTPUT LEVEL CONTROL | 


_ inputs allow the user to manipulate the shift register data if 


he so chooses. With the OUTPUT CONTROL input high, the 
last bits of the shift register, that is the SERIAL OUTPUT, 
SO, and the S8 bit, are prohibited from shifting while all the 
other bits are being shifted by the PIXEL CLOCK. This has 
the effect of performing a basic scrolling function on the 
screen. The OUTPUT LEVEL CONTROL input determines 
to what level the last bit is set. When using the OUTPUT 
CONTROL, even though the eighth bit is held at the output, 
internally, information will be shifted through this bit. 


Circuit Operation (Continued) 


As mentioned above, the HOLD input must be low for the 
PIXEL CLOCK to be able to shift data out of the shift regis- 
ter. A basic zoom feature may be performed by holding the 
HOLD input at a high level. When the HOLD input is held 
high for a certain number of PIXEL CLOCK periods, the 
same information will be present at the shift register output 
for the duration of the hold time. This has the effect of 
“stretching” the information from one pixel to multiple pix- 
els. 


The VSR may be used in systems employing word lengths 
other than 16 bits. For example, in a 32-bit system two 


VSR chips may be cascaded to form a 32-bit word. The SO 
output of the first VSR is fed into the SI input of the second 
to accomplish this. In systems using 8 bit words, the S8 
output may be used in addition to the SO output resulting in 
one VSR being used for two words and cutting in half the 
number of packages required to do the parallel to serial 
conversion. 

Any word length can be used so long as the PIXEL CLOCK 
rate is equal to the PARALLEL LOAD rate times the word 
length. 


Four Word FIFO Mode (MO = 0, M1 = 1) 
inputting four words without reading 
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Four Word FIFO Mode (MO = 0, M1 = 1) 
Word 1 written over 
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Transparent Latch Mode (MO = 1,M1 = X) 
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Circuit Operation (Continueg) 
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Circuit Operation (Continued) 


Shift Register Operation with Output Control 
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FIFO Timing Requirements 
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Circuit Operation (Continued) 
Transparent Latch Timing Requirements 
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Shift Register Timing Requirements 


tsus 


tus 


PL 
ENABLE 
% _ tsu3 
PIXEL CLK 
tha 
SIN XXKK! XX XXXXXXXXKKKAKAKAKA KAKA NAY 
—| f= ue hi, 
a e—tsut sui 

HOLD 


$0,580 XXXXKXXXXXKX 


XXXKXXKKXM) 
—| — teo1/o —-po1/o —>| baz 


FIGURE 9 


1-171 


TL/F/8684-16 


TL/F/8684-9 


0SE-91S8d0/91S8d0/0SE-SIS8d0/SlS8dd 


DP8515/DP8515-350/DP8516/DP85 16-350 


Circuit Operation (Continued) 


Shift Register Timing Requirements 
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Equivalent Input/Output Schematics 


TTL. VCC vcco 


TL/F/8684-10 
FIGURE 11. TTL Inputs 


VEE 


FIGURE 12. ECL Inputs 


VEE ; | | TL/F/8684-~13 
FIGURE 14. DP8515/DP8515-350 10K ECL Output 
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FIGURE 13. Vgp Output 
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FIGURE 15. DP8516/DP8516-350 100K ECL Output 


Connections to the DP8515/16 


Figure 16 is a typical connection diagram for the VSR. Not 
shown in this diagram are the power connections. The data 
inputs, DO—-D15, come from memory which may be Video 
Rams as an example. The clock signals are derived from 
clock generator circuitry such as National Semiconductor’s 
DP8512. The VSR is capable of receiving differential ECL 
inputs. If differential use is not desired, the unused inputs 
may be tied to the Vsp reference which is provided as an 
output on the VSR. Although Figure 76 shows the PIXEL 
CLOCK and PARALLEL LOAD inputs as single ended sig- 
nals, the DP8512 provides these as differential outputs. 


The ECL outputs, SO and S8, are differential. Again, the user 
has the option of using these single ended if he so chooses. 
In systems where words other than 8 or 16 bits in length are 
used, the SI inputs may be fed from the SO outputs of anoth- 
er VSR in order to modify the word length, such as increas- 
ing the length to 32 bits. 


Figure 16 shows the part being continuously enabled by 
having the EN input tied to TTL ground. When this is done 
the clock generator circuitry should be able to inhibit the 
PARALLEL LOAD signal while a screen retrace is in prog- 
ress. Another configuration might be to use the ENABLE 
input to gate the PARALLEL LOAD input. In this mode the 
clock generator circuitry must still ensure that the PARAL- 
LEL LOAD signal has the proper phase relationship with 
respect to the video sync signal. By using the ENABLE sig- 
nal to gate the PARALLEL LOAD signal, the PARALLEL 
LOAD signal may be selectively gated to various video shift 
registers. 


Figure 17 shows how when word lengths other than 32, 16, 
or 8 bits are used, several VSR circuits can be interfaced to 
reduce the total number of shift registers required for a sys- 
tem. In the example shown, three VSR circuits are used to 
shift two 24-bit words. Similar configurations can be used for 
other word lengths. 
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FIGURE 16. Typical Connection to the DP8515/DP8515-350/DP8516/DP85 16-350 
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FIGURE 17. DP8515/DP8515-350/DP8516/DP85 16-350 Interconnection for 24-bit Word 
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DP8520A/DP8521A/DP8522A 


PRELIMINARY 


microCMOS Programmable 


256k/1M/4M Video RAM Controller/Drivers 


General Description | 3 eu 


The DP8520A/21A/22A video RAM controllers provide a 


low cost, single chip interface between video RAM and all 
8-, 16- and 32-bit systems. The DP8520A/21A/22A gener- 
ate all the required access control signal timing for VRAMs. 
An on-chip refresh request clock is used to automatically 
refresh the VRAM array. Refreshes and accesses are arbi- 
trated on chip. If necessary, a WAIT or DTACK output in- 
serts wait states into system access cycles, including burst 
mode accesses. RAS low time during refreshes and RAS 
precharge time after refreshes and back to back accesses 
are guaranteed through the insertion of wait states. Sepa- 
rate on-chip precharge counters for each RAS output can 
be used for memory interleaving to avoid delayed back to 
back accesses because of precharge. An additional feature 
of the DP8522A is two access ports to simplify dual access- 
ing. Arbitration among these ports and refresh is done on 


Features 

@ On chip high precision delay line to guarantee critical 
VRAM access timing parameters | 

@ microCMOS process for low power 

mw High capacitance drivers for RAS, CAS, DT/OE 
VRAM address on chip : : 

mw On chip support for nibble, page and static column | 
VRAMs . . 

m Byte enable signals on chip allow byte writing in a word 
size up to 16 bits with no external logic 

w Selection of controller speeds: 20 MHz and 25 MHz 

m On board Port A/Port B (DP8522A only)/refresh arbitra- 
tion logic 

m Direct interface to all major microprocessors (applica- 
tion notes available) aa 

mw 4 RAS and 4 CAS drivers (the RAS and CAS configura- 


and 
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1.0 iieeducion 


The DP8520A/21A/22A are CMOS Video RAM controllers 
that incorporate many advanced features including the ca- 
pabilities of address latches, refresh counter, refresh clock, 
row, column and refresh address multiplexor, delay line, re- 
fresh/access/VRAM transfer cycle arbitration logic and 
high capacitive drivers. The programmable system interface 
allows any manufacturer's microprocessor or bus to directly 
interface via the DP8520A/21A/22A to VRAM arrays up to 
64 Mbytes in size. 

After power up, the DP8520A/21A/22A must first be pro- 
grammed before accessing the VRAM. The eee is pro- 
grammed through the address bus. 


There are two methods of programming the chip, The first 
method, mode load only, is accomplished by asserting the 
signal mode load, ML. A valid programming selection is pre- 
sented on the row, column, bank and ECAS inputs, then ML 
is negated. When ML is negated, the chip is programmed 
with the valid programming bits on the address bus. 


The second method, chip selected access, is accomplished 
by asserting ML and performing a chip selected access. 
When CS and AREQ are asserted for the access, the chip is 
programmed. During this programming access, the pro- 
gramming bits affecting the wait logic become effective im- 
mediately, allowing the access to terminate. After the ac- 
cess, ML is negated and the rest of the programming bits 
take effect. 

Once the DP8520A/21A/22A has been programmed, a 
60 ms initialization period is entered. During this time, the 
DP8520A/21A/22A controllers perform refreshes to. the 
VRAM array so further VRAM warm up cycles are unneces- 
sary. a ~ 

The DP8520A/21A/22A can now be used to access the 
VRAM. There are two modes of accessing with the control- 
ler. The two modes are Mode 0, which initiates RAS syn- 
chronously, and Mode 1, which initiates RAS asynchronous- 


ily. 


To access the VRAM using Mode 0, the igtial ALE is as- 
serted along with CS to ensure a valid VRAM access. ALE 
asserting sets an internal latch and only needs to be pulsed 
and not held throughout the entire access. On the next ris- 
ing clock edge, after the latch is set, RAS will be asserted 
for that access. The DP8520A/21A/22A will place. the row 
address on the VRAM address bus, guarantee the pro- 
grammed value of row address hold time of the VRAM, 
place the column address on the VRAM address bus, guar- 
antee the programmed value of column address setup time 
and assert.CAS. AREQ can be asserted anytime, after the 
clock edge which starts the access RAS. RAS and CAS will 
extend until AREQ is negated. 


The other access mode, Mode 1, is asynchronous to the 
clock. When ADS is asserted, RAS is asserted. The 
DP8520A/21A/22A’ will place the row address on the 
VRAM address bus, guarantee the programmed value of 
row address hold time, place the column address on the 
VRAM address bus, guarantee the programmed value of 
column address setup time and assert CAS. AREQ can be 
tied to ADS or can be asserted after ADS is asserted. AREQ 
negated will terminate the access. 


The DP8520A/21A/22A also provides full support for 
VRAM transfer cycles. To begin the cycle, the input 
AVSRLRQ, Advanced Video Shift Register Load Request, is 
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asserted and must precede the input VSRL, Video. Shift 
Register Load, asserting by enough CLK periods to guaran- 
tee any access in progress or pending refresh can finish. 
VSRL asserting causes DT/OE to transition low immediate- 
ly. Both VSRL and DT/OE assert before RAS and CAS as- 
sert for the transfer. The cycle is ended by DT/OE negating. 
This is caused by either VSRL negating or by four rising 
edges of CLK from VSRL asserting, whichever comes first. 


The DP8520A/21A/22A have greatly expanded refresh ca- 
pabilities compared to other. VRAM controllers. There are 
three modes of refreshing available. These modes are inter- 
nal automatic refreshing, externally controlled/burst re- 
freshing, and refresh request/acknowledge refreshing. Any 
of these modes can be used together or separately to 
achieve the desired results. In any combination of these 
modes, the programming of ECASO determines the use of 
the RFIP (RFRQ) pin. ECASO asserted during programming 
causes this pin to function as RFIP which will assert just 
prior to a refresh cycle and will negate when the refresh is 
completed. ECASO negated during programming causes 
this pin to function as RFRQ which indicates an internal 
refresh request when asserted. 


When using internal automatic refreshing, the DP8520A/ 
21A/22A will generate an internal refresh request from the 
refresh request clock. The DP8520A/21A/22A will arbitrate 
between the refresh requests and accesses. Assuming an 
access is not currently in progress, the DP8520A/21A/22A 
will grant a refresh, assert RFIP if programmed, and on the 
next positive clock edge, refreshing will begin. If an access 
had been in progress, the refresh will begin after the access 
has terminated. 


To use externally controlled/ silts refresh, the user disables 
the internal refresh request by asserting the input 
DISRFRSH. A refresh can now be externally requested by 
asserting the input RFSH. The DP8520A/21A/22A will arbi- 
trate between the external refresh request and accesses. 
Assuming an access is not currently in progress, the 
DP8520A/21A/22A will grant a refresh, assert RFIP if pro- 
grammed, and on the next positive clock edge, refreshing 
will begin. If an access had been in progress, the refresh 
would take place after the access has terminated. 


With refresh request/acknowledge mode, the DP8520A/ 
21A/22A broadcasts the internal refresh request to the sys- 
tem through the RFRQ output pin. External circuitry can de- 
termine when to refresh the VRAM through the RFSH input. 


The controllers have three types of refreshing available: 
conventional, staggered and error scrubbing. Any refresh 
control mode can be used with any type of refresh. In a 
conventional refresh, all of the RAS outputs will be asserted 
and negated at once. In a staggered refresh, the RAS out- 
puts will be asserted one positive clock edge apart. Error 
scrubbing is the same as conventional refresh except that a 
CAS will be asserted during a refresh allowing the system to 
run that data through an EDAC chip and write it back to 
memory, if a single bit error has occurred. The refreshes 
can be extended with the EXTEND REFRESH input, 
EXTNDRF. 

The DP8520A/21A/22A have wait support available as 
DTACK or WAIT. Both are programmable. DTACK, Data 
Transfer ACKnowledge, is useful for processors whose wait 
signal is active high. WAIT is useful for processors whose 
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wait signal is active low. The user can choose either at pro- 
gramming. These signals are used by the on-chip arbitor to 
insert wait states to guarantee the arbitration between ac- 
cesses and refreshes or precharge. Both signals are inde- 
pendent of the access mode chosen. 


DTACK will assert a programmed number of clock edges 
from the event that starts the access RAS. DTACK will be 
negated, when the access is terminated, by AREQ being 
negated. DTACK can also be programmed to toggle with 
the ECAS inputs during burst/page mode accesses. 


WAIT is asserted during the start of the access (ALE and 
CS, or ADS and CS) and will negate a number of clock 
edges from the event that starts the access RAS. After 
WAIT is negated, it will stay negated until the next access. 
WAIT can also be programmed to toggle with ECAS inputs 
during a burst/page mode access. 


Both signals can be dynamically delayed further through the 
WAITIN signal to the DP8520A/21A/22A. 


The DP8520A/21A/22A have address latches, used to 
latch the bank, row and column address inputs. Once the 
address is latched, a column increment feature can be used 
to increment the column address. The address latches can 
also be programmed to be fall through. 


The RAS and CAS drivers can be configured to drive a one, 
two or four bank memory array up to 32 bits in width. The 
two ECAS signals can then be used to select one pair of 
CAS drivers for byte writing with no external logic for sys- 
tems with a word length of up to 16 bits. 


When configuring the DP8520A/21A/22A for more than 
one bank, memory interleaving can be used. By tying the 
low order address bits to the bank select lines, BO and B1, 
sequential back to back accesses will not be delayed since 
the DP8520A/21A/22A have separate precharge counters 
per bank. The DP8520A/21A/22A are capable of perform- 
ing address pipelining. In address pipelining, the DP8520A/ 
21A/22A guarantee the column address hold time and 
switch the internal multiplexor to place the row address on 
the address bus. At this time, another memory access to 
another bank can be initiated. 
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The DP8522A has all the features previously mentioned. 
Unlike the DP8520A/21A, the DP8522A has a second port 
to allow a second CPU to access the memory array. This 
port, Port B, has two control signals to allow a CPU to ac- 
cess the VRAM array. These signals are access request for 
Port B, AREQB, and Advanced Transfer ACKnowledge for 
Port B, ATACKB. Two other signals are used by both Port A 
and Port B for dual accessing purposes. The signals are 
lock, LOCK and grant Port B, GRANTB. All arbitration for 
the two ports and refresh is done on-chip by the DP8522A 
through the insertion of wait states. Since the DP8522A has 
only one input address bus, the address lines have to be 
multiplexed externally. The signal GRANTB can be used for 
this purpose since it is asserted when Port B has access to 
the VRAM array and negated when Port A has access to the 
VRAM array. Once a port has access to the array, the other 
port can be “locked out” by asserting the input LOCK. 
AREQB, when asserted, is used by Port B to request an 
access. ATACKB, when asserted, signifies that access RAS 
has been asserted for the requested Port B access. By us- 
ing ATACKB, the user can generate an appropriate WAIT or 
DTACK like signal for the Port B CPU. 

The following explains the terminology used in this data 
sheet. The terms negated and asserted are used. Asserted 
refers to a “true” signal. Thus, “ECASO asserted” means 
the ECASO input is at a logic 0. The term “‘COLINC assert- 
ed” means the COLINC input is at a logic 1. The term negat- 
ed refers to a ‘“‘false’”’ signal. Thus, “ECASO negated” 
means the ECASO input is at a logic 1. The term “COLINC 
negated” means the input COLINC is at a logic 0. The table 
shown below clarifies this terminology. 


| Signa | Action | Logic Level__ 
rete it | Nees [tow 
|_ActveLow | Asserted [| Low 
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Connection Diagrams (continued) 
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DP8520A/DP8521A/DP8522A 


2.0 Signal Descriptions 


Input/ 
Output 


Device (If not 
applicable to all) 


Description 


2.1 ADDRESS, R/W AND PROGRAMMING SIGNALS 


RO-10 DP8522A 

RO-9 DP8520A/21A 

CO-10 DP8522A 

COo-9 DP8520A/21A 

: 7 

_ —_ 

COLINC 

(EXTNDRF) 

ie ae 
2.2 VRAM CONTROL SIGNALS 

QO0-10 DP8522A 

Q0-9 DP8521A 

Q0-8 DP8521A 

7 = 
—_ a 
7 a 


ROW ADDRESS: These inputs are used to specify the row address during an 
access or refresh to the VRAM or for a VRAM transfer cycle. They are also used 
to program the chip when ML is asserted (except R10). 


COLUMN ADDRESS: These inputs are used to specify the column address 
during an access to the VRAM or for a VRAM transfer cycle. They are also used 
to program the chip when ML is asserted (except C10). 


BANK SELECT: Depending on programming, these inputs are used to select a 
group of RAS and CAS outputs to assert during an access. They are also used to 
program the chip when ML is asserted. 


ENABLE CAS: These inputs are used to enable a single or group of CAS outputs 
when asserted. In combination with the BO, B1 and the programming bits, these 
inputs select which CAS output or CAS outputs will assert during an access. 
ECASO must be asserted for either CASO or CAS1 to assert during an access. 
ECAS1 must be asserted for either CAS2 or CAS3 to assert during an access. 
The ECAS signals can also be used to toggle a group of CAS outputs for 
page/nibble mode accesses. They also can be used for byte write operations. If 
ECASO is negated during programming, continuing to assert the ECASO while 
negating AREQ or AREQB during an access, will cause the CAS outputs to be 
extended while the RAS outputs are negated (the ECASn inputs have no effect 
during scrubbing refreshes). 


WRITE ENABLE IN: This input is used to signify a write operation to the VRAM. 
This input asserted will also cause CAS to delay to the next positive clock edge if 
address bit C9 is asserted during programming. 


COLUMN INCREMENT: When the address latches are used, and a refresh is not 
in progress, this input functions as COLINC. Asserting this signal causes the 
column address to be incremented by one. When a refresh is in progress, this 
signal, when asserted, is used to extend the refresh cycle by any number of 
periods of CLK until it is negated. 


MODE LOAD: This input signal, when low, enables the internal programming 
register that stores the programming information. 


VRAM ADDRESS: These outputs are the multiplexed output of the RO-9, 10 
and CO-9, 10 and form the VRAM address bus. These outputs contain the 
refresh address whenever a refresh is in progress. They contain high capacitive 
drivers with 200 series damping resistors. 


ROW ADDRESS STROBES: These outputs are asserted to latch the row 
address contained on the outputs Q0-8, 9, 10 into the VRAM. When a refresh is 
in progress, the RAS outputs are used to latch the refresh row address contained 
on the Q0-8, 9, 10 outputs in the VRAM. These outputs contain high capacitive 
drivers with 200 series damping resistors. 


COLUMN ADDRESS STROBES: These outputs are asserted to latch the 
column address contained on the outputs Q0-8, 9, 10 into the VRAM. These 
outputs have high capacitive drivers with 209 series damping resistors. — 


DATA TRANSFER/OUTPUT ENABLE: This output transitions low before RAS 
goes low and transitions high before RAS goes high during a video RAM shift 
register load operation (see VSRL pin description). During normal write accesses 
this output is held high, and for read accesses this output is asserted after CAS is 
asserted, and is negated after CAS negates. 
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2.0 Signal Descriptions (Continue) 


Pin Device (if not Input/ 
Name applicable to all) | Output 


2.3 REFRESH SIGNALS 


; a 


; pd 


al 


2.4 PORT A ACCESS 


2.5 PORT B ACCESS SIGNALS 


AREGB | DP8522A 
only 

ATACKB DP8522A 
only 


Description 


REFRESH IN PROGRESS or REFRESH REQUEST: When ECAS0 is asserted 
during programming, this output functions as RFIP, and is asserted prior to a 
refresh cycle and is negated when all the RAS outputs are negated for that 
refresh. When ECASO is negated during programming, this output functions as 
RFRQ. When asserted, this pin specifies that 13 ws or 15 ws have passed. If 
DISRFSH is negated, the DP8520A/21A/22A will perform an internal refresh as 
soon as possible. If DISRFRSH is asserted, RFRQ can be used to externally 
request a refresh through the input RFSH. This output has a high capacitive 
driver and a 202 series damping resistor. 


REFRESH: This input asserted with DISRFRSH already asserted will request a 
refresh. If this input is continually asserted, the DP8520A/21A/22A will perform 
refresh cycles in a burst refresh fashion until the input is negated. If RFSH is 
asserted with DISRFSH negated, the internal refresh address counter is cleared 
(useful for burst refreshes). 


DISABLE REFRESH: This input is used to disable internal refreshes and must 
be asserted when using RFSH for externally requested refreshes. 


ADDRESS STROBE or ADDRESS LATCH ENABLE: Depending on 
programming, this input can function as ADS or ALE. In mode 0, the input 
functions as ALE and when asserted along with CS causes an internal latch to 
be set. Once this latch is set an access will start from the positive clock edge of 
CLK as soon as possible. In Mode 1, the input functions as ADS and when 
asserted along with CS, causes the access RAS to assert if no other event is 
taking place. If an event is taking place, RAS will be asserted from the positive 
edge of CLK as soon as possible. In both cases, the low going edge of this signal 
latches the bank, row and column address if programmed to do so. 


CHIP SELECT: This input signal must be asserted to enable a Port A access. 


ACCESS REQUEST: This input signal in Mode 0 must be asserted some time 
after the first positive clock edge after ALE has been asserted. When this signal 
is negated, RAS is negated for the access. In Mode 1, this signal must be 
asserted before ADS can be negated. When this signal is negated, RAS is 
negated for the access. 7 


WAIT or DTACK: This output can be programmed to insert wait states into a 
CPU access cycle. With R7 negated during programming, the output will function 
as a WAIT type output. In this case, the output will be active low to signal a wait 
condition. With R7 asserted during programming, the output will function as 
DTACK. In this case, the output will be negated to signify a wait condition and will 
be asserted to signify the access has taken place. Each of these signals can be 
delayed by a number of positive clock edges or negative clock levels of CLK to 
increase the microprocessor’s access cycle through the insertion of wait states. 


WAIT INCREASE: This input can be used to dynamically increase the number of 
positive clock edges of CLK until DTACK will be asserted or WAIT will be 
negated during a VRAM access. 


PORT B ACCESS REQUEST: This input asserted will latch the row, column and 
bank address if programmed, and requests an access to take place for Port B. If 
the access can take place, RAS will assert immediately. If the access has to be 
delayed, RAS will assert as soon as possible from a positive edge of CLK. 


ADVANCED TRANSFER ACKNOWLEDGE PORT B: This output is asserted 
when the access RAS is asserted for a Port B access. This signal can be used to 
generate the appropriate DTACK or WAIT type signal for Port B’s CPU or bus. 


1-181 


VWecS8d0/VicS8dd/Vv0cS8dd 
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2.0 Signal Descriptions (Continued) 


Device (If not Input/ 
| applicable to all) | Output 
| 2.6 COMMON DUAL PORT SIGNALS 


GRANTB | DP8522A 
only 


LOCK DP8522A 
only 


| 2.7 VRAM TRANSFER CYCLE SIGNALS 


AVSRLRQ | 


Description 


GRANT B: This output indicates which port is currently granted access to the 
VRAM array. When GRANTEB is asserted, Port B has access to the array. When 
GRANTB is negated, Port A has access to the VRAM array. This signal is used to 


multiplex the signals RO-8, 9, 10; CO—8, 9, 10; BO—-1; WIN; LOCK and ECASO-1 
to the DP8522A when using dual accessing. 


LOCK: This input can be used by the currently granted port to “lock out’ the 
other port from the VRAM array by inserting wait states into the locked out port’s 
access cycle until LOCK is negated. 


ADVANCED VIDEO SHIFT REGISTER LOAD REQUEST: This must precede 
the VSRL input going low by the amount of time necessary to guarantee that any 
currently executing access and pending refresh can finish. This input disables 
Port B and refresh requests until four CLK periods after VSRL has transitioned 
low. This input may be held low until the video RAM transfer cycle is completed 
or may be momentarily pulsed low. 


VIDEO SHIFT REGISTER LOAD: This input causes the DT/OE output to 
transition low immediately. Therefore, when executing a video RAM shift register 
load, VSRL transitions low before RAS goes low. The DT/OE output will 
transition high from VSRL going high or four CLK periods (rising clock edges) 


| from VSRL going low, whichever occurs first. VSRL low also disables the WIN 


2.8 POWER SIGNALS AND CAPACITOR INPUT 


2.9 CLOCK INPUTS 


input from affecting the DT/OE logic, until the video shift register load access is 
over. 


POWER: Supply Voltage. 
GROUND: Supply Voltage Reference. 


CAPACITOR: This input is used by the internal PLL for stabilization. The value of 
the ceramic capacitor should be 0.1 wF and should be connected between this 


~ input and ground. 


There are two clock inputs to the DP8520A/21A/ 22A, CLK and DELCLK. These two clocks may both be tied to the same clock input, 
or a may be two separate clocks, running at different frequencies, asynchronous to each other. 


DELCLK 


SYSTEM CLOCK: This input may be in the range of 0 Hz up to 25 MHz. This 
input is generally a constant frequency but it may be controlled externally to 
change frequencies or perhaps be stopped for some arbitrary period of time. — 
This input provides the clock to the internal state machine that arbitrates 
between accesses and refreshes. This clock’s positive edges and negative 
levels are used to extend the WAIT (DTACK) signals. Ths clock is also used as 
the reference for the RAS precharge time and RAS low time during refresh. 

All Port A and Port B accesses are assumed to be synchronous to the system 
clock CLK. 


DELAY LINE CLOCK: The clock input DELCLK, may be in the range of 6 MHz to 
20 MHz and should be a multiple of 2 (i.e., 6, 8, 10, 12, 14, 16, 18, 20 MHz) to 
have the DP8520A/21A/22A switching characteristics hold. If DELCLK is not 
one of the above frequencies the accuracy of the internal delay line will suffer. 
This is because the phase locked loop that generates the delay line assumes an 
input clock frequency of a multiple of 2 MHz. 

For example, if the DELCLK input is at 7 MHz and we choose a divide by 3 
(program bits CO-—2) this will produce 2.333 MHz which is 16.667% off of 2 MHz. 
Therefore, the DP8520A/21A/22A delay line would produce delays that are 
shorter (faster delays) than what is intended. If divide by 4 was chosen the delay 
line would be longer (slower delays) than intended (1.75 MHz instead of 2 MHz). 
(See Section 10 for more information.) 

This clock is also divided to create the internal refresh clock. 
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3.0 Port A Access Modes 


The DP8520A/21A/22A have two general purpose access 
modes. With one of these modes, any microprocessor can 
be interfaced to VRAM. A Port A access to VRAM is initiated 
by two input signals: ADS (ALE) and CS. The access is al- 
ways terminated by one signal: AREQ. These input signals 
should be synchronous to the input clock, CLK. One of 
these access modes is selected at programming through 
the B1 input signal. In both modes, once an access has 
been requested by CS and ADS (ALE), the DP8522A will 
guarantee the following: 


The DP8520A/21A/22A will have the row address valid to 
the VRAMs’ address bus, QO0-8, 9, 10 given that the row 
address setup time to the DP8520A/21A/22A was met; 


The DP8520A/21A/22A will bring the appropriate RAS or 
RASs low; 


The DP8520A/21A/22A will guarantee the minimum row 
address hold time, before switching the internal multiplexor 
to place the column address on the VRAM address bus, 
QO-8, 9, 10; 

The DP8520A/21A/22A will guarantee the minimum col- 
umn address setup time before asserting the appropriate 
CAS or CASs; 


The DP8520A/21A/22A will hold the column address valid 
the minimum specified column address hold time in address 
pipelining mode and will hold the column address valid the 
remainder of the access in non-pipelining mode. 


The chip includes a WIN pin to signify a write operation to 
the DP8520A/21A/22A. When asserted, WIN will cause 
CAS to delay to the next positive clock edge if address bit 
C9 is asserted during programming. When negated, WIN will 
cause the DT/OE output to follow the CAS outputs for a 
read access, if ECASO is negated during programming. WE, 
write enable, must be externally gated from the processor to 
the VRAM as there is no output pin from the WIN input pin 
available on chip. 


3.1 ACCESS MODE 0 


Access Mode 0, shown in Figure 5a, is selected by negating 
the input B1 during programming. This access mode allows 
accesses to VRAM to always be initiated from the positive 
edge of the system input clock, CLK. To initiate a Mode 0 
access, ALE is pulsed high and CS is asserted. Pulsing ALE 
high and asserting CS, sets an internal latch which requests 
an access. If the precharge time from the last access or 


VRAM refresh had been met and a refresh of VRAM, a Port. 


B access, or a VRAM transfer cycle was not in progress, the 
RAS or group of RASs would be initiated from the first posi- 
tive edge of CLK. If a VRAM refresh is in progress or pre- 
charge time is required, the controller will wait until these 
events have taken place and assert RAS on the next posi- 
tive edge of CLK. 
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Sometime after the first positive edge of CLK after ALE and 
CS have been asserted, the input AREQ must be asserted. 
In single port applications, once AREQ has been asserted, 
CS can be negated. Once AREQ is negated, RAS and 
DTACK, if programmed, will be negated. If ECASO is assert- 
ed during programming, CAS will be negated with AREQ. If 
ECASO was negated during programming, a single CAS or 
group of CASs will continue to be asserted after RAS has 
been negated given that the appropriate ECASs inputs were 
asserted as shown in Figure 5b. This allows the VRAM to 
have data present on the data out bus while gaining RAS 
precharge time. ALE can stay asserted several periods of 
CLK. However, ALE must be negated before or during the 
period of CLK in which AREQ is negated. 


When performing address pipelining, the ALE input cannot 
be asserted to start another access until AREQ has been 
asserted for at least one clock period of CLK for the present 
access. 


3.2 ACCESS MODE 1 


Access Mode 1, shown in Figure 6a, is selected by asserting 
the input B1 during programming. This mode allows access- 
es, which are not delayed by precharge, Port B access, 
VRAM transfer cycle or refresh, to start immediately from 
the access request input, ADS. To initiate a Mode 1 access, 
CS is asserted followed by ADS asserted. If the pro- 
grammed precharge time from the last access or VRAM 
refresh had been met and a refresh of the VRAM, a Port B 
access to the VRAM, or a VRAM transfer cycle was not in 
progress, the RAS or group of RASs selected by program- 
ming and the bank select inputs would be asserted from 
ADS being asserted. If a VRAM refresh, a Port B access, or 
a VRAM transfer cycle is in progress or precharge time is 
required, the controller will wait until these events have tak- 
en place and assert RAS or the group of RASs from the 
next positive edge of CLK. 


When ADS is asserted or sometime after, AREQ must be 
asserted. At this time, ADS can be negated and AREQ will 
continue the access. Once AREQ is negated, RAS and 
DTACK, if programmed, will be negated. If ECASO was as- 
serted during programming, CAS will be negated with 
AREQ. If ECASO was negated during programming, a single 
CAS or group of CASs will continue to be asserted after 
RAS has been negated given that the appropriate ECAS 
inputs were asserted as shown in Figure 6b. This allows a 
VRAM to have data present on the data out bus while gain- 
ing RAS precharge time. ADS can continue to be asserted 
after AREQ has been asserted and negated, however a new 
access would not be started until ADS is negated and as- 
serted again. ADS and AREQ can be tied together in appli- 
cations not using address pipelining. 


If address pipelining is programmed, it is possible for ADS to 
be negated after AREQ is asserted. Once AREQ is assert- 
ed, ADS can be asserted again to initiate a new access. 
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3.0 Port A Access Modes (continued) 
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3.0 Port A Access Modes (continued) 
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DP8520A/DP8521A/DP8522A 


4.0 Refresh Options 


The DP8520A/21A/22A support a wide variety of refresh 
control mode options including automatic internally con- 
trolled refresh, externally controlled/burst refresh, refresh 
request/acknowledge and any combination of the above. 
With each of the control modes above, different types of 
refreshes can be performed. These different types include 
all RAS refresh, staggered refresh and error scrubbing dur- 
ing all RAS refresh. 


There are three inputs, EXTNDRF, RFSH and DISRFSH, 
and one output, RFIP (RFRQ), associated with refresh. 
There are also ten programming bits; RO-1, R9, CO—6 and 
ECASO used to program the various types of refreshing. 


The two inputs, RFSH and DISRFSH, are used in the exter- 
nally controlled/burst refresh mode and the refresh re- 
quest/acknowledge mode. The output RFRQ is used in the 
refresh request/acknowledge mode. The input EXTNDRF is 
used in all refresh modes and the output RFIP is used in all 
refresh modes except the refresh request/acknowledge 
mode. Asserting the input EXTNDRF, extends the refresh 
cycle single or multiple integral clock periods of CLK. The 
output RFIP is asserted one period of CLK before the first 
refresh RAS is asserted. If an access is currently in prog- 
ress, RFIP will be asserted up to one period of CLK before 
the first refresh RAS, once AREQ or AREQB is negated for 
the access (see Figure 7a). 


The DP8520A/21A/22A will increment the refresh address 
counter automatically, independent of the refresh mode 
used. The refresh address counter will be incremented once 
all the refresh RASs have been negated. 


In every combination of refresh control mode and refresh 
type, the DP8520A/21A/22A is programmed to keep RAS 
asserted a number of CLK periods. The values of RAS low 
time during refresh are programmed with the programming 
bits RO and R1. -_ 


4.1 REFRESH CONTROL MODES 


There are three different modes of refresh control. Any of | 


these modes can be used in combination or singularly to 
produce the desired refresh results. The three different 
modes of control are: automatic internal refresh, external/ 
burst refresh and refresh request/acknowledge. 


4.1.1 Automatic Internal Refresh — 


’ The DP8520A/21A/22A have an internal refresh clock. The 


period of the refresh clock is generated from the program- 
ming bits CO-—3. Every period of the refresh clock, an inter- 
nal refresh request is generated. As long as a VRAM access 
is not currently in progress and precharge time has been 
met, the internal refresh request will generate an automatic 
internal refresh. If a VRAM access is in progress, the 
DP8520A/21A/22A on-chip arbitration logic will wait until 
the access is finished before performing the refresh. The 
refresh/access arbitration logic can insert a refresh cycle 
between two address pipelined accesses. However, the re- 
fresh arbitration logic can not interrupt an access cycle to 
perform a refresh. To enable automatic internally controlled 
refreshes, the input DISRFSH must be negated. 


4.1.2 Externally Controlled/Burst Refresh 


To use externally controlled/burst refresh, the user must 
disable the automatic internally controlled refreshes by as- 
serting the input DISRFSH. The user is responsible for gen- 
erating the refresh request by asserting the input RFSH. 


Pulsing RFSH low, sets an internal latch, that is used to 
produce the internal refresh request. The refresh cycle will 
take place on the next positive edge of CLK as shown in 
Figure 7b. \f an access to VRAM is in progress or precharge 
time for the last access has not been met, the refresh will be 


‘delayed. Since pulsing RFSH low sets a latch, the user does 


not have to keep RFSH low until the refresh starts. When 
the last refresh RAS negates, the internal refresh request 
latch is cleared. 


By keeping RFSH asserted past the positive edge of CLK 
which ends the refresh cycle as shown in Figure 8, the user 
will perform another refresh cycle. Using this technique, the 
user can perform a burst refresh consisting of any number 
of refresh cycles. Each refresh cycle during a burst refresh 
will meet the refresh RAS low time and the RAS precharge 
time (programming bits RO—1). 

If the user desires to burst refresh the entire VRAM (all row 
addresses) he could generate an end of count signal (burst 
refresh finished) by looking at one of the DP8520A/21A/ 
22A high address outputs (Q7, Q8, Q9 or Q10) and the RFIP 
output. The Qn outputs function as a decode of how many 
row addresses have been refreshed (Q7 = 128 refreshes, 
Q8 = 256 refreshes, Q9 = 512 refreshes, Q10 = 1024 
refreshes). 


4.1.3 Refresh Request/Acknowledge | 


The DP8520A/21A/22A can be programmed to output in- 
ternal refresh requests. When the user programs ECASO 
negated during programming, the RFIP output functions as 
RFRQ. RFRQ will be asserted from a positive edge of CLK 
as shown in Figure 9a. Once RFRQ is asserted, it will stay 
asserted until the RFSH is pulsed low with DISRFSH assert- 


. ed. This will cause an externally requested/burst refresh to 


take place. If DISRFSH is negated, an automatic internal 
refresh will take place as shown in Figure 9b. 


RFRQ will go high and then assert if additional periods of 
the internal refresh clock have expired and neither an exter- 
nally controlled refresh nor an automatically controlled inter- 
nal refresh have taken place as shown in Figure 9c. \f a time 
critical event, or long access like page/static column mode 
access can not be interrupted, RFRQ pulsing high can be 
used to increment a counter. The counter can be used to 
perform a burst refresh of the number of refreshes missed 


_. (through the RFSH input). 
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4.2 REFRESH CYCLE TYPES 


Three different types of refresh cycles are available for use. 
The three different types are mutually exclusive and can be 
used with any of the three modes of refresh control. The 
three different refresh cycle types are: all RAS refresh, stag- 
gered RAS refresh and error scrubbing during all RAS re- 
fresh. In all refresh cycle types, the RAS precharge time is 
guaranteed: between the previous access RAS ending and 
the refresh RASO starting; between refresh RAS3 ending 
and access RAS beginning; between burst refresh RASs. 


4.2.1 Conventional RAS Refresh 


A conventional refresh cycle causes RASO-3 to all assert 
from the first positive edge of CLK after RFIP is asserted as 
shown in Figure 10. RASO-3 will stay asserted until the 
number of positive edges of CLK programmed have passed. 
On the last positive edge, RASO-3, and RFIP will be negat- 
ed. This type of refresh cycle is programmed by negating 
address bit R9 during programming. 


4.0 Refresh Options (continued) 


RFRO*RFSH*RFIP 


ACIP*RFRQ 
TL/F/9338-10 


Explanation of Terms 


RFRQ = _ ReFresh ReQuest internal to the DP8520A/21A/22A. 
RFRQ has the ability to hold off a pending access. 


RFSH = _ Externally requested ReFreSH 
RFIP =  ReFresh In Progress 


ACIP = __— Port A or Port B (DP8522A only) ACcess In Progress. 
This means that either RAS is low for an access or is 
in the process of transitioning low for an access. 


FIGURE 7a. DP8520A/21A/22A Access/Refresh Arbitration State Program 


V22S8d0/Vi2S8dd/Vv0cssdd 


CLK ty ty , 
DISRFSH 


RFSH 


RFP [ 
(if programmed) 
RAS fi 


Q0-8,9,10 REFRESH ROW 


FIGURE 7b. Single External Refresh (2 Periods of RAS Low during Refresh Programmed) 
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4.0 Refresh Options (Continued) 
CLK : | 
ars SS SAY as 
7 | ET a 
RAS i aaa: oe ee PRECHARGE 


Q0-8,9,10 TITTTITITITICT ITLL TTT ReRESH ROW OSCR ROWS) eRe ROWS 2 0 0 
TL/F/9338-12 
FIGURE 8. External Burst Refresh (2 Periods of RAS Precharge, 
2 Periods of Refresh RAS Low during Refresh Programmed) 
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. | TL/F/9338-13 
FIGURE 9a. Externally Controlled Single and Burst Refresh with Refresh Request (RFRQ) Output 
(2 Periods of RAS Low during Refresh Programmed) 
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TL/F/9338—14 
FIGURE 9b. Automatic Internal Refresh with Refresh Request (3T of RAS low during refresh programmed) 


*In Figures 9a and 9b, where the DP8520A/21A/22A operate in the refresh request/acknowledge mode, the RFIP output pin functions as RFRQ. 
An RFIP timing waveform is included in the figure solely for the purpose of simplifying the diagrams. 


1-188 


68L-1L 


CLK 
. 13 or 15nsc ~500 ns—| 
RFRQ 


RFSH 


DISRFSH 


FIGURE 9c. Refresh Request Timing 
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FIGURE 10. Conventional RAS Refresh 
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FIGURE 11. Staggered RAS Refresh 
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4.0 Refresh Options (Continue) 
4.2.2 Staggered RAS Refresh 


A staggered refresh staggers each RAS or group of RASs | 
_ by a positive edge of CLK as shown in Figure 77. The num- 


ber of RASs, which will be asserted on each positive edge 
of CLK, is determined by the RAS, CAS configuration mode 
programming bits C4—C6. If single RAS outputs are select- 
ed during programming, then each RAS will assert on suc- 
cessive positive edges of CLK. If two RAS outputs are se- 
lected during programming then RASO and RAS1 will assert 
on the first positive edge of CLK after RFIP is asserted. 
RAS2 and RAS3 will assert on the second positive edge of 
CLK after RFIP is asserted. If all RAS outputs were selected 
during programming, all RAS outputs would assert on the 
first positive edge of CLK after RFIP is asserted. Each RAS 
or group of RASs will meet the programmed RAS low time 
and then negate. 


4.2.3 Error Scrubbing during Refresh 

The DP8520A/21A/22A support error scrubbing during all 
RAS VRAM refreshes. Error scrubbing during refresh is se- 
lected through bits C4—C6 with bit R9 negated during pro- 
gramming. Error scrubbing can not be used with staggered 
refresh (see Section 9.0). Error scrubbing during refresh al- 


lows a CAS or group of CASs to assert during the all RAS 
refresh as shown in Figure 12. This allows data to be read 
from the VRAM array and passed through an Error Detec- 
tion And Correction Chip, EDAC. It is important to note that 
while an error scrubbing during refresh access is being pe- 
formed, it is the system designer’s responsibility to properly 
control the WE input of the VRAM. WE should be high dur- 
ing the initial access of the VRAM, which could be accom- 
plished by gating RFIP, if programmed, with the processor 
access circuitry that creates WE. If the EDAC determines 
that the data contains a single bit error and corrects that 
error, the refresh cycle can be extended with the input ex- 
tend refresh, EXTNDRF, and a read-modify-write operation 


_can be performed, and the corrected data can be written 


back to the VRAM by'bringing WE low. The DP8522A has a 
24-bit internal refresh address counter that contains the 11 
row, 11 column and 2 bank addresses. The DP8520A/21A 
have a 22-bit internal refresh address counter that contains 
the 10 row, 10 column and 2 bank addresses. These coun- 
ters are configured as bank, column, row with the row ad- 
dress as the least significant bits. The bank counter bits are 
then used with the programming selection to determine 
which CAS or group of CASs will assert during a refresh. 


DRAM ADDRESS | XREFRESH ROW REFRESH COLUMN REFRESH COLUMN ACCESS 


RASO 
CASO 


RAS1 


RAS2 

-CAS2 

CAS3 

DRAM DATA 

WE 

(VRAM input) 

SINGLE BiT VRAM ERROR 


EXTNDRF | 


DATA W/ERROR CORRECTED DATA 
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FIGURE 12. Error Scrubbing during Refresh 
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4.0 Refresh Options (continued) 


4.3 EXTENDING REFRESH 


The programmed number of periods of CLK that refresh 
RASs are asserted can be extended by one or multiple peri- 
ods of CLK. Only the all RAS (with or without error scrub- 
bing) type of refresh can be extended. To extend a refresh 
cycle, the input extend refresh, EXTNDRF, must be assert- 
ed before the positive edge of CLK that would have negated 
all the RAS outputs during the refresh cycle and after the 
positive edge of CLK which starts all RAS outputs during the 
refresh as shown in Figure 73. This will extend the refresh to 
the next positive edge of CLK and EXTNDRF will be sam- 
pled again. The refresh cycle will continue until EXTNDRF is 
sampled low on a positive edge of CLK. 


4.4 CLEARING THE REFRESH ADDRESS COUNTER 


The refresh address counter can be cleared by asserting 
RFSH while DISRFSH is negated as shown in Figure 14a. 
This can be used prior to a burst refresh of the entire memo- 


ry array. By asserting RFSH one period of CLK before 


DISRFSH is asserted and then keeping both inputs assert- 


ed, the DP8520A/21A/22A will clear the refresh address 
counter and then perform refresh cycles separated by the 
programmed value of precharge as shown in Figure 14b. An 
end-of-count signal can be generated from the Q VRAM 
address outputs of the DP8520A/21A/22A and used to ne- 
gate RFSH. 


4.5 CLEARING THE REFRESH REQUEST CLOCK 


The refresh request clock can be cleared by negating 
DISRFSH and asserting RFSH for 500 ns, one period of the 
internal 2 MHz clock as shown in Figure 75. By clearing the 
refresh request clock, the user is guaranteed that an inter- 
nal refresh request will not be generated for approximately 
15 ps, one refresh clock period, from the time RFSH is neg- 
ated. This action will also clear the refresh address counter. 


VeeS8d0/VicS8dd/Vv0cS8dd 


EDGE OF CLK 
WHICH SHOULD 
HAVE ENDED REFRESH 


CLK 


RFIP 
(if programmed) 


RASO=3 


EXTNDRF S t 


TL/F/9338-19 
FIGURE 13. Extending Refresh with the Extend Refresh (EXTNDRF) Input 
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FIGURE 14a. Clearing the Refresh Address Counter 
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FIGURE 14b. Clearing the Refresh Counter during Burst 
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FIGURE 15. Clearing the Refresh Request Clock Counter 
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5.0 Port A Wait State Support 


Wait states allow a CPU’s access cycle to be increased by 
one or multiple CPU clock periods. By increasing the CPU’s 
access cycle, all signals associated with that access cycle 
are extended. The wait or ready input is named differently 
by CPU manufacturers. However, any CPU’s wait or ready 
input is compatible with either the WAIT or DTACK output of 
the DP8520A/21A/22A. The CPU samples a wait or ready 
line to determine if another clock period should be inserted 
into the access cycle. If another clock period is inserted, the 
CPU will continue to sarnple the input every CPU clock peri- 
od until the input signal changes polarity, allowing the CPU 
access cycle to terminate. The user determines whether to 
program WAIT or DTACK (R7) and which value to select for 
WAIT or DTACK (R2, R3) depending upon the CPU used 
and where the CPU samples its wait input during an access 
cycle. 


The decision to terminate the CPU access cycle is directly 
affected by the speed of the VRAMs used. The system de- 
signer must ensure that the data from the VRAMs will be 
present for the CPU to sample or that the data has been 
written to the VRAM before allowing the CPU access cycle 
to terminate. 


The insertion of wait states also allows a CPU’s access cy- 
cle to be extended until the VRAM access has taken place. 
The DP8520A/21A/22A insert wait states into CPU access 
cycles due to; guaranteeing precharge time, refresh current- 
ly in progress, user programmed wait states, the WAITIN 
signal being asserted and GRANTB not being valid 
(DP8522A only). If one of these events is taking place and 
the CPU starts an access, the DP8520A/21A/22A will insert 
wait states into the access cycle, thereby increasing the 
length of the CPU’s access. Once the event has been com- 
pleted, the DP8520A/21A/22A will allow the access to take 
place and stop inserting wait states. 

There are six programming bits, R2-R7; an input, WAITIN; 
and an output that functions as WAIT or DTACK. 


5.1 WAIT TYPE OUTPUT 


With the R7 address bit negated during programming, the 
user selects the WAIT output. As long as WAIT is sampled 
asserted by the CPU, wait states (extra clock periods) are 
inserted into the current access cycle as shown in Figure 
76. Once WAIT is sampled negated, the access cycle is 
completed by the CPU. WAIT is asserted at the beginning of 
a chip selected access and is programmed to negate a 
number of positive edges and/or negative levels of CLK 
from the event that starts the access. WAIT can also be 
programmed to function in page/burst mode applications. 


| T1 


CLK 


WaT 


WAIT SAMPLED 


LOW 


WAITSTATE IN THE 


Once WAIT is negated during an access, and the ECAS 
inputs are negated with AREQ asserted, WAIT can be pro- 
grammed to toggle, following the ECAS inputs. Once AREQ 
is negated, ending the access, WAIT will stay negated until 
the next chip selected access. 


5.1.1. Wait during Single Accesses 


WAIT can be programmed to delay a number of positive 
edges and/or negative levels of CLK. These options are 
programmed through address bits R2 and R3 at program- 
ming time. The user is given four options described below. 


OT during non delayed and delayed acceses: WAIT will stay 
negated during a non-delayed access as shown in Figures 
17a and 17c. During an access that is delayed, WAIT will 
assert at the start of the access (CS and ALE or ADS) and 
negate from the positive edge of CLK that starts RAS for 
that access as shown in Figures 176 and 17d. 


OT during non-delayed accesses and 14T during delayed 
accesses: WAIT will stay negated during a non-delayed ac- 
cess as shown in Figures 18a and 78c. During an access 
that is delayed, WAIT will assert at the start of the access 


CS and ALE or ADS) and negate on the negative level of 


CLK after the positive edge of CLK that asserted RAS for 
that access as shown in Figures 18b and 78d. 


YT during non-delayed and delayed accesses: if mode 0 is 
used, WAIT will assert when ALE is asserted and CS is 
asserted. WAIT will then negate on the negative level of 
CLK after the positive edge of CLK that asserts RAS for the 
access as shown in Figure 19a. In Mode 1, WAIT will assert 
from CS asserted and ADS asserted. WAIT will then negate 
on the negative level of CLK after RAS has been asserted 
for the access as shown in Figure 79c. During delayed ac- 
cesses in both modes, WAIT will assert at the start of the 
access and negate on the negative level of CLK after the 
positive edge of CLK that started RAS for that access as 
shown in Figures 19b and 19d. 


1T during non-delayed and delayed accesses. In Mode 0, 

WATT will assert from ALE asserted and CS asserted. WAIT 
will negate from the next positive edge of CLK that asserts 
RAS for the access as shown in Figure 20a. \n Mode 1, 

WAIT will assert from ADS asserted and CS asserted. WAIT 
will negate from the first positive edge of CLK after ADS and 
CS have been asserted as shown in Figure 20c. During de- 
layed accesses in both modes, WAIT will assert at the be- 
ginning of the access and will negate on the first positive 
edge of CLK after the positive edge of CLK that starts RAS 
for the access as shown in Figures 20b and 20d. 
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FIGURE 16. WAIT Type Output 
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5.0 Port A Wait State Support (Continued) © 
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FIGURE 17a. Mode 0 Non-Delayed Access with WAIT OT (WAIT is Sampled at the End of the “T2” Clock State) 


CLK 


ALE 


RASn — PRECHARGE J 


WAIT 
TL/F/9338—-25 
FIGURE 17b. Mode 0 Delayed Access with WAIT OT 
(“2T” RAS Precharge, WAIT is Sampled at the End of the “T2” Clock State) 
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5.0 Port A Wait State Support (continue) 
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FIGURE 17c. Mode 1 Non-Delayed Access with WAIT OT (WAIT is Sampled at the End of the “T2” Clock State) 
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FIGURE 17d. Mode 1 Delayed Access with WAIT OT (WAIT is Sampled at the End of the “T2” Clock State) 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 18a. Mode 0 Non-Delayed Access with WAIT OT (WAIT is Sampled at the “T3” Falling Clock Edge) 
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FIGURE 18b. Mode 0 Delayed Access with WAIT 14T (WAIT is Sampled at the “T3” Falling Clock Edge) 
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5.0 Port A Wait State Support (continue) 
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FIGURE 18c. Mode 1 Non-Delayed Access with WAIT OT (WAIT is Sampled at the “T3” Falling Clock Edge) 
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FIGURE 18d. Mode 1 Delayed Access with WAIT 1,T (WAIT is Sampled at the “T3” Falling Clock Edge) 
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5.0 Port A Wait State Support (continued) 
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FIGURE 19a. Mode 0 Non-Delayed Access with WAIT 1,T (WAIT is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 19b. Mode 0 Delayed Access with WAIT 14T (WAIT is Sampled at the “T2” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 19c. MODE 1 Non-Delayed Access with WAIT 14T (WAIT is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 19d. Mode 1 Delayed Access with WAIT 1,T (WAIT is Sampled at the “T2” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 20a. Mode 0 Non-Delayed Access with WAIT 1T (WAIT is Sampied at the End of the “T2” Clock State) 
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FIGURE 20b. Mode 0 Delayed Access with WAIT 1T (WAIT is Sampled at the End of the “T2” Clock State) 
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5.0 Port A Wait State Support (Continued) 


i al 
CLK 


WS 


\\ 


WAIT 


T2 


_ 


! 
{ | 
{ t 
| ' 7, 
! j 


TL/F/9338-38 


FIGURE 20c. Mode 1 Non-Delayed Access with WAIT 1T (WAIT is Sampled at the End of the “T2” Clock State) 
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FIGURE 20d. Mode 1 Delayed Access with WAIT 1T (WAIT is Sampled at the End of the “T2” Clock State) 


When ending WAIT from a negative level of CLK; if RAS is 
asserted while CLK is high then WAIT will negate from the 
negative edge of CLK; if RAS is asserted while CLK is low 
then WAIT will negate from RAS asserting. When ending 
WAIT from a positive edge of CLK in Mode 0, the user can 
think of the positive edge of CLK that starts RAS as OT and 
the next positive edge of CLK as 1T. When. ending WAIT 
from a positive edge of CLK in Mode 1, the positive edge of 
CLK that ADS is setup to can be thought of as 1T in a non- 
delayed access. In a delayed access, the positive edge of 
CLK that starts RAS can be thought of as OT and the next 
positive edge as iT. 
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5.1.2 Wait during Page Burst Accesses 


WAIT can be programmed to function differently during 
page/burst types of accesses. During a page/burst access, 
the ECAS inputs will be asserted then negated while AREQ 
is asserted. Through address bits R4 and R5, WAIT can be 
programmed to assert and negate during this type of ac- 
cess. The user is given four programming options described 
below. 


No Wait States: In this case, WAIT will remain negated even 
if the ECAS inputs are toggled as shown in Figure 27. 


5.0 Port A Wait State Support (Continued) 
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FIGURE 21. No Wait States during Burst (WAIT is Sampled at the End of the ‘‘T3” Clock State) 
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FIGURE 22. OT during Burst (WAIT is Sampled at the End of the “T3” Clock State) 
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FIGURE 23. 14T during Burst Access (WAIT is Sampled at the “T3” Falling Clock Edge) 


OT: WAIT will be asserted when the ECAS inputs are negat- 1%T: WAIT will be asserted when the ECAS inputs are nega- 
ed with AREQ remaining asserted. When one or both of the ted with AREQ remaining asserted. When one or both of the 
ECAS inputs are asserted, WAIT will be negated as shown ECAS inputs are asserted again, WAIT will be negated from 
in Figure 22. the first negative level of CLK after one or both of the 


ECASs are asserted as shown in Figure 23. 
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5.0 Port A Wait State Support (Continued) 


1T: WAIT will be asserted when the ECAS inputs are negat- 
ed with AREQ remaining asserted. When one or both of the 
ECAS inputs are asserted again, WAIT will be negated from 


the first positive edge of CLK after a single ECAS or both of — 


the ECASs are asserted as shown in Figure 24. 


When ending WAIT from a negative level of CLK; if the 
ECASs are asserted while CLK is high then WAIT will ne- 
gate from the negative edge of CLK, if the ECASs are as- 
serted while CLK is low then WAIT will negate from the 
ECASs asserting. When ending WAIT from a positive edge 
of CLK, the positive edge of CLK that ECAS is setup to can 
be thought of as 1T. 


5.2 DTACK TYPE OUTPUT 


With the R7 address bit asserted during programming, the 
user selects the DTACK type output. As long as DTACK is 
sampled negated by the CPU, wait states are inserted into 
the current access cycle as shown in Figure 25. Once 
DTACK is sampled asserted, the access cycle is completed 
by the CPU. DTACK, which is normally negated, is pro- 
grammed to assert a number of positive edges and/or neg- 
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_ FIGURE 24. 1T during Burst Access (WAIT is Sampied at the End of the “T3” Clock State) 


CLK 


DTACK 


ative levels from the event that starts RAS for the access. 
DTACK can also be programmed to function during page/ 
burst mode accesses. Once DTACK is asserted and the 
ECAS inputs are negated with AREO asserted, DTACK can 
be programmed to negate and assert from the ECAS inputs 
toggling to perform a page/burst mode operation. Once 
AREQ is negated, ending the access, DTACK will be negat- 
ed and stays negated until the next chip selected access. 


5.2.1 DTACK during Single Accesses 


DTACK can be programmed to delay a number of positive 
edges and/or negative levels of CLK. These options are 
programmed through address bits R2 and R3 at program- 
ming time. The user is oven four options described by the 
following. 


OT during non-delayed accesses and delayed accesses: in 
Mode 0, DTACK will assert from the positive edge of CLK 
which starts RAS as shown in Figure 26a. In Mode 1, 
DTACK will assert from ADS and CS as shown in Figure 
26c. During delayed accesses in both modes, DTACK will 
assert from the positive edge of CLK which starts RAS for 
the access as shown in Figure 26b and 26d. 
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FIGURE 25. DTACK Type Output 
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5.0 Port A Wait State Support (Continued) 
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TL/F/9338-~45 
FIGURE 26a. Mode 0 Non-Delayed Access with DTACK OT (DTACK is Sampled at the End of the “T2” Clock State) 
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FIGURE 26b. Mode 0 Delayed Access with DTACK OT (2T Clock Periods Are Programmed 
for RAS Precharge, DTACK is Sampled at the End of the “T2” Clock State) 
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FIGURE 26c. Mode 1 Non-Delayed Access with DTACK OT (DTACK is Sampled at the End of the “T2” Clock State) 
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FIGURE 26d. Mode 1 Delayed Access with DTACK OT (DTACK is Sampled at the End of the “T2” Clock State) 
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5.0 Port A Wait State Support (Continued) 


%T during non-delayed and delayed accesses: In Mode 0, ed as shown in Figures 27c and 27d. During delayed ac- 


DTACK will assert on the negative level of CLK after the cesses in both modes, DTACK will assert from the negative 
positive edge of CLK which starts RAS as shown in Figure level of CLK after the positive edge of CLK which starts 
27a. \n Mode 1, DTACK will assert from the negative level of RAS for the access as shown in Figures 27b and 27e. 
CLK after ADS has been asserted given that RAS is assert- 
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FIGURE 27a. Mode 0 Non-Delayed Access with DTACK of 1,T aes is Sampled at the “T3” Falling Clock Edge) 
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FIGURE 27b. Mode 0 Delayed Access with DTACK of 12T (DTACK is Sampled at the “T3” Falling Clock Edge) 
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FIGURE 27c. Mode 1 Non-Delayed Access with DTACK of 14T (OTACK is Sampled at the “T2” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 27d. Mode 1 Non-Delayed Access with DTACK of 12T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 27e. Mode 1 Delayed Access with DTACK of 14T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 28a. Mode 0 Non-Delayed Access with DTACK of 1T (DTACK is Sampled at the End of the “T2” Clock State) 


1T during delayed and non-delayed accesses: In Mode 0, positive edge CLK after ADS and CS are asserted as shown 
DTACK will assert from the first positive edge of CLK after in Figures 28¢c and 28d. During delayed accesses in both 
the positive edge of CLK which starts RAS for the access as modes, DTAGK will assert from the first positive edge of 
shown in Figure 28a. In Mode 1, DTACK will assert from the CLK after the positive edge of CLK which starts RAS for the 


access as shown in Figures 28b and 28e. 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 28b. Mode 0 Delayed Access with DTACK of 1T (DTACK is Sampled at the End of the “T2” Clock State) 
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FIGURE 28c. Mode 1 Non-Delayed Access with DTACK of 1T (DTACK is Sampled at the End of the “‘T2” Clock State) 
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FIGURE 28d. Mode 1 Late Non-Delayed Access with DTACK of 1T | 
_. (DTACK is Sampled at the End of the “T2” Clock State) 
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5.0 Port A Wait State Support (Continued) 


ee oe ee 


Tw T3 ™ 


CLK t, 1) 
' ' 
' ' 
a ! | 
ADS ' 


RASn PRECHARGE 


DTACK 


TL/F/9338-64 


FIGURE 28e. Mode 1 Delayed Access with DTACK of 1T (DTACK is Sampled at the End of the “T2” Clock State) 


114T during delayed and non-delayed accesses: In Mode 0, 
DTACK will assert from the negative level after the first posi- 
tive edge of CLK after the positive edge of CLK which starts 
RAS for the access as shown in Figure 29a. In Mode 1, 
DTACK will assert from the negative level after the first posi- 
tive edge of CLK after ADS and CS are asserted as shown 
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NO 


CLK D 


ALE 


DTACK 


in Figures 29c and 29d. During delayed accesses in both 
modes, DTACK will assert from the negative level after the 
first positive edge of CLK after the positive edge of CLK 
which starts RAS for the access as shown in Figures 29b 
and 29e. 
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FIGURE 29a. Mode 0 Non-Delayed Access with DTACK of 114T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 29b. Mode 0 Delayed Access with DTACK of 114T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 
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FIGURE 29c. Mode 1 Non-Delayed Access with DTACK of 114T (DTACK is Sampled at the “T2” Falling Clock Edge) 


| qm | T2 | Ww | TW | e | 


CLK 


ADS 4 


DTACK 


TL/F/9338-68 
FIGURE 29d. Mode 1 Non-Delayed Access with DTACK of 112T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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FIGURE 29e. Mode 1 Delayed Access with DTACK of 114T (DTACK is Sampled at the “T2” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 


When starting DTACK from a negative level of CLK; if RAS 
is asserted while CLK is high then DTACK will assert from 
the negative edge of CLK, if RAS is asserted while CLK is 
low, then DTACK will assert from RAS asserting. When 
starting DTACK from a positive edge of CLK in Mode 0, the 
positive edge of CLK that starts RAS can be thought of as 
OT. In Mode 1 during non-delayed accesses, the positive 
edge of CLK that ADS is setup to can be thought of as 1T. 
During delayed accesses, the positive edge of CLK that 
starts RAS can be thought of as OT and the next positive 
edge of CLK as 1T. 


5.2.2 DTACK during Page/Burst Accesses 


DTACK can be programmed to function differently during 
page/burst types of accesses. During a page/burst access, 


CLK 


| on ® t 
BEGINNING 


Gees GR 
AREQ access 


RASn 
CAS 


DTACK ee 


the ECAS inputs will be asserted then negated while AREQ 
remains asserted. Through address bits R4 and R5, DTACK 
can be programmed to negate and assert during this type of 
access. The user is given four programming options de- 
scribed below. 

No Wait States: In this case, DTACK will remain asserted 


even if the ECAS inputs are negated with AREQ asserted as 
shown in Figure 30. 

OT: DTACK will be negated when the ECAS inputs are nega- 
ted with AREQ asserted. When one or both ECAS inputs 
are asserted again, DTACK will be asserted as shown in 
Figure 31. 
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FIGURE 30. No Wait States during Burst Access (DTACK is Sampled at the End of the “T3” Clock State) 
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FIGURE 31. OT during Burst Access (DTACK is Sampled at the End of the “T3” Clock State) 
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5.0 Port A Wait State Support (Continued) 


YeT: DTACK will be negated when the ECAS inputs are neg- 
ated with AREQ asserted. When one or both ECAS inputs 
are asserted again, DTACK will be asserted from the first 
negative level of CLK after ECAS is asserted as shown in 
Figure 32. 
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1T: DTACK will be negated when the ECAS inputs are nega- 
ted with AREQ asserted. When one or both ECAS inputs 
are asserted again, DTACK will be asserted from the first 
positive: edge of CLK after Sone is asserted as shown in 
Figure 33. 
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FIGURE 32. 14T during Burst Access (DTACK is Sampled at the “T3” Falling Clock Edge) 
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_ FIGURE 33. 1T during Burst Access (DTACK is Sampled at the “T3” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 


When starting DTACK from a negative level of CLK; if the 
ECASs are asserted while CLK is high then DTACK will as- 
sert from the negative edge of CLK, if the ECASs are assert- 
ed while CLK is low then DTACK will assert from the ECASs 
asserting. When starting DTACK from a positive edge of 
CLK, the positive edge of CLK that ECAS is setup to can be 
thought of as 1T. 


5.3 WAIT STATE SUPPORT FOR VIDEO RAM SHIFT 
REGISTER LOAD OPERATIONS FOR PORT A 

If using the DP8520A/21A/22A in a system using video 
VRAMs, the CPU that controls loading the Video RAM shift 
register must be connected to Port A. The input AVSRLRQ 
asserts, signaling an advanced request for a Video RAM 


shift register load operation. Sometime later, the input VSRL 
asserts, signifying that the transfer cycle has started, and 
this action causes the DT/OE output to transfer low. VSRL 
asserting also asserts WAIT (keeps DTACK negated) and 
will then insert wait states into the transfer cycle. The trans- 
fer cycle is completed from either VSRL negating or four 
clocks from VSRL asserting. The first event of these two to 
take place causes WAIT to negate (DTACK to assert) imme- 
diately or one half system clock period later, depending on 
how the user had programmed WAIT to end (DTACK to 
start) during a non-burst type of access. The wait logic is 
intimately connected to the DP8520A/21A/22A graphics 
functions and the WAIT output functions the same as the 
DT/OE output (see Figure 34). 
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FIGURE 34. Wait State Timing during a VRAM Transfer Cycle (WAIT 
Programmed as OT, WAIT Sampled at the “T3” Rising Clock Edge) 
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5.0 Port A Wait State Support (Continued) 


5.4 DYNAMICALLY INCREASING THE ee OF 
WAIT STATES | 


The user can increase the number of positive edges of CLK 
before DTACK is asserted or WAIT is negated. With the 
input WAITIN asserted, the user can delay DTACK asserting 
or WAIT negating either one or two more positive edges of 
CLK. The number of edges is programmed through address 
bit R6. If the user is increasing the number of positive edges 
in a delay that contains a negative level, the positive edges 
will be met before the negative level. For example if the user 
programmed DTACK of ‘%T, asserting WAITIN, pro- 
grammed as 2T, wouid increase the number of positive edg- 
es resulting in DTACK of 214T as shown in Figure 35a. Simi- 
larly, WAITIN can increase the number of positive edges in 
a page/burst access. WAITIN can be permanently asserted 
in systems requiring an increased number of wait states. 


WAITIN can also be asserted and negated, depending on 


the type of access. As an example, a user could invert the 


WRITE line from the CPU and connect the output to | 


WAITIN. This could be used to perform write accesses with 


1 wait state and read accesses with 2 wait states as shown 
in Figure 35b. 


WAITIN 


ACCESS WITH DTACK OF 47.WITH WAITIN NEGATED 


5.5 GUARANTEEING RAS “oY TIME AND RAS 
PRECHARGE TIME — 


The DP8520A/21A/ 224 will guarantee RAS precharge time 
between accesses; between refreshes; and between ac- 
cess and refreshes. The programming bits RO and Ri are 
used to program combinations of RAS precharge time and 
RAS low time referenced by positive edges of CLK. RAS 
low time is programmed for refreshes only. During an ac- 
cess, the system designer guarantees the time RAS is as- 
serted through the DP8520A/21A/22A wait logic. Since in- 
serting wait states into an access. increases the length of 
the CPU signals which are.used to create ADS or.ALE and 
AREQ, the time that RAS is asserted can be guaranteed.. 


Precharge time is also guaranteed by the DP8520A/21A/ 
22A. Each RAS output has a separate positive edge of CLK 
counter. AREQ is negated setup to a positive edge of CLK 
to terminate the access. That positive edge is 1T. The next 
positive edge is 2T. RAS will not be asserted until the pro- 
grammed number of positive edges of CLK have passed as 
shown in Figure 36. Once the programmed precharge time 
has been met, RAS will be asserted from the positive edge 
of CLK. However, since there isa precharge counter per 
RAS, an access using another RAS will not be delayed. 
Precharge time before a refresh is always referenced from 
the access RAS negating before RASO for the refresh as- 
serting. After a refresh, precharge time is referenced from 
RAS3 negating, for the refresh, to the access RAS assert- 


ing. 


ACCESS WITH DTACK OF 57 WITH WAITIN 
ASSERTED PROGRAMMED AS 2T 
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FIGURE 35a. WAITIN Example (DTACK is Sampled at the “T3” Falling Clock Edge) 
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5.0 Port A Wait State Support (Continued) 


VeeS8d0/V1lcS8d0/Vv0cs8da 


ACCESS WITH WAIT OF 1T WITH WAITIN NEGATED ACCESS WITH WAIT OF 1T WITH WAITIN 
ASSERTED PROGRAMMED AS 1T 


FIGURE 35b. WAITIN Example (WAIT is Sampled at the End of “‘T2”’) 
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FIGURE 36. Guaranteeing RAS Precharge (DTACK is Sampled at the “T2” Falling Clock Edge) 
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6.0 DP8520A/21A/22A = 4024 COLUMNS 
Video RAM Support 


The DP8520A/21A/22A provides full support for all access 
modes of video RAMs through the addition of three pins 
-(AVSRLRG, VSRL, and DT/OE) to the standard 
DP8420A/21A/22A. The access modes of video RAMs can 
be split up into two groups; video RAM transfer cycles (read 
with the serial port in active or in standby mode, write, and 
pseudo write transfer cycles), and non-transfer cycles. The — 
DP8520A/21A/22A support of video RAMs allows the full 
capabilities of the National Semiconductor Advanced 
Graphics chip set (DP8500 Series) to be realized. See Fig- 
ures 37, 38, and 58a. 
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FIGURE 37. The Video RAM (A Dual Ported Memory) 
Ideal solution for graphics frame buffer. Screen refresh 
can occur at the same time as random access to the 
frame buffer for screen update and manipulation. 
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FIGURE 38. The DP8500 Raster Graphics Processor interfaced to the DP8520A/21A/22A Video RAM Controller 
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6.0 DP8520A/21A/22A Video RAM Support (Continued) 


6.1 SUPPORT FOR VRAM TRANSFER CYCLES 
(TO THE SERIAL PORT OF THE VRAM) 


The DP8520A/21A/22A supports VRAM transfer cycles 
with the serial port in the active or standby mode. Active or 
standby refers to whether data is or is not currently being 
shifted in or out of the VRAM serial port (i.e., whether the 
shift clock (SCLK) is currently active). The DP8520A/ 
21A/22A support for data transfer cycles with the serial port 
in the active mode includes the ability to support transfer 
cycles with the serial port in the standby mode. Hereafter, 
the term VRAM transfer cycle means VRAM transfer cycle 
with the serial port in the active mode. 


In order to support VRAM transfer cycles, the DP8520A/ 
21A/22A must be able to guarantee timing with respect to 
its input CLK (which must be synchronous to VRAM shift 
clock), RAS, CAS, and DT/OE. Figure 38 shows the timing 
of a graphics memory’ system where _ the 
DP8520A/21A/22A is being used with the National Semi- 
conductor DP8500 Raster Graphics Processor (RGP). If the 
DP8520A/21A/22A is being used in a graphics frame buffer 
application, it has the ability to support a VRAM transfer 
cycle during active video time (ex. mid scan line). This is one 
of the very attractive features supported by the National 
Semiconductor Advanced Graphics chip set. Most of the 
commercial graphics controller chip sets available will only 
support VRAM transfer cycles during blanking periods 
(while the VRAM is in standby mode). 


The DP8520A/21A/22A supports VRAM transfer cycles 
during active video time by being able to guarantee an exact 
instant during which the transfer of VRAM data to the VRAM 
shift register will occur. This exact instant can be guaran- 
teed through the AVSRLRQ and VSRL inputs. 


The input AVSRLRQ disables any further internally or exter- 
nally requested refreshes or Port B access requests from 
being executed. The AVSRLRQ input does this by making 
the VRAM controller arbitration logic think that a Port A ac- 
cess is in progress from the point where the AVSRLRQ in- 
put asserts until the VRAM shift register load operation is 
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completed. Figure 38 shows the case of an externally re- 
quested refresh being disabled, because of a previous 
AVSRLRQ, until the VRAM shift register load has been 
completed. 


The VSRL input causes the DT/OE output to assert immedi- 
ately, regardiess of what else may be happening in the 
DP8520A/21A/22A. Therefore, it is the system designer’s 
responsibility to guarantee that all pending accesses have 
been completed by the time the VSRL input asserts. The 
system designer can guarantee this by issuing AVSRLRQ 
far enough in advance to guarantee that all pending access- 
es have been completed by the time VSRL asserts. 


The AVSRLRO input does not override the LOCK input (see 
Section 12.0) for dual port systems, and as a result, the 
designer must also guarantee that Port A can be accessed 
by assuring that GRANTB and LOCK are both not asserted 
when AVSRLRQ is asserted. 


Generally, the VSRL is the status of the upcoming access 
cycle (of the graphics processor). Therefore, this input pre- 
cedes the inputs ADS and AREQ that execute the VRAM 
shift register load transfer cycle. This sequence of events 
guarantees the correct relationship of DT/OE, RAS and 
CAS (DT preceding RAS and CAS when asserting and neg- 
ating). The wait logic is also intimately connected to the 
graphics functions on the DP8520A/21A/22A. The DT/OE 
(and WAIT/DTACK) relationship to VSRL during a VRAM 
transfer cycle depends upon how the DP8520A/21A/22A 
was programmed with respect to the ECASO input. If ECASO 
was negated during programming, the DT/OE output will 
follow the VSRL input. If ECASO was asserted during pro- 
gramming, the DT/OE output will follow VSRL asserting. 
DT/OE will then negate either when VSRL negates or from 
the fourth rising clock edge after VSRL asserted, whichever 
event takes place first. This allows DT to negate before 
RAS and CAS negate, thus guaranteeing the correct timing 
relationship during the transfer cycle (see Figure 38). The 
WE input of the VRAM determines whether the access is a 
read or write transfer cycle (see Figures 39 and 40 respec- 
tively). 
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6.0 DP8520A/21A/22A Video RAM Supportt (Continued) 
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(Transfer Shift Register Data into VRAM Row) 
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6.0 DP8520A/21A/22A Video RAM Support (Continued) 


During a transfer cycle (VSRL asserted during the access) 
WIN is disabled from affecting the DT/OE logic until the 
transfer cycle is completed as shown by CAS negating. Dur- 
ing a transfer cycle, the SOE (Serial Output Enable) input to 
the VRAM is asserted and is used as an output control for a 
read transfer cycle and is used as a write enable control 
during a write transfer cycle. When SOE is negated, serial 
access is disabled, and a transfer cycle cannot take place. 
SOE asserted during a read enables the serial input/output 
bus SI/O (0-3) while the VRAM data bus (DQO-3) is put 
into a high impedance state, thus allowing the transfer cycle 


to take place from the serial port. In addition to both read 
and write transfer cycles, the DP8520A/21A/22A also sup- 
ports pseudo write transfer cycles (see Figure 47). A pseu- 
do write transfer cycle must be performed after a read trans- 
fer cycle if the subsequent operation is a write transfer cy- 
cle. The DP8520A/21A/22A VRAM controller is operated 
as if it is doing a write transfer cycle, but since the SOE input 
to the VRAM is negated (disabling the serial port), a transfer 
doesn’t take place. The purpose of this pseudo write trans- 
fer cycle is to switch the SI/O (0-3) lines of the VRAM’s 
serial port from output mode to input mode. 


RAS \ / 
| 
CAS ? \ / 


1 
| 


ADDRESS ROW ADD 


| 
' 
' 
t 


WE 

(VRAM input) 
VSRL 

DT/OE 

DQ0 = 3 

SOE 


SCLK 


i VALID DATA 
FROM READ _ 
WAIT ae. 


Si/0 0-3 


COL ADD 
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FIGURE 41. Video RAM Timing Pseudo WRITE Transfer Cycle, B Port Active 
(Transfer Shift Register Data into VRAM Row) 
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6.0 DP8520A/21A/22A Video RAM Support (Continued) © 


6.2 SUPPORT FOR VRAM ACCESS CYCLES THROUGH | 
PORT A USING THE DP8520A/21A/22A | 
With the DP8520A/21A/22A, the output DT/OE will remain 
negated during write accesses (see Figure 42), but during 
read accesses it will assert after CASn asserts. DT/OE will 
be negated for a read access once CASn is negated (see 
Figure 43), causing the VRAM outputs to be enabled. If 
CASn toggles during a page mode read access, then the 
DT/OE logic will also toggle following the CASn input. 


RAS 
CAS 
WB/WE 
DT/OE 
FIGURE 42. Video RAM Random Access Write Cycle 
RAS : YY | DP 
= > eee ee ‘ a 
— _ ; = (| coun XT 
a ely 
We /WE | 
DT/OE 
bx , 


FIGURE 43. Video RAM Random Access Read Cycle 
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7.0 Additional Access Support Features 


To support the different modes of accessing, the DP8520A/ 
21A/22A have multiple access features. These features al- 
low the user to take advantage of CPU or VRAM functions. 
These additional features include: address latches and col- 
umn increment for page/burst mode support; address pipe- 
lining to allow a new access to start to a different bank of 
VRAM after CAS has been asserted and the column ad- 
dress hold time has been met; and delay CAS, to allow the 
user with a multiplexed bus to ensure valid data is present 
before CAS is asserted. 


7.1 ADDRESS LATCHES AND COLUMN INCREMENT 


The address latches can be programmed, through program- 
ming bit BO, to either latch the address or remain perma- 
nently in fall-through mode. If the address latches are used 
to latch the address, the rising edge of ALE in Mode 0 
places the latches in fall-through. Once ALE is negated, the 
address present on the row, column and bank inputs is 
latched. In Mode 1, the address latches are in fall-through 
mode until ADS is asserted. ADS asserted latches the ad- 
dress. 


Once the address is latched, the column address can be 
incremented with the input COLINC. With COLINC asserted, 
the column address is incremented. If COLINC is asserted 
with all of the bits of the column address asserted, the col- 
umn address will return to zero. COLINC can be used for 
sequential accesses of static column VRAMs. COLINC can 
also be used with the ECAS inputs to support sequential 
accesses to page mode VRAMs as shown in Figure 44. 
COLINC should only be asserted when a refresh is not in 
progress as indicated by RFIP, if programmed, being ne- 
gated during an access since this input functions as an ex- 
tend refresh when a refresh is in progress. 


The address latches function differently with the DP8522A. 
The DP8522A will latch the address of the currently granted 
port. If Port A is currently granted, the address will be 
latched as described in Section 7.1. If Port A is not granted, 
and requests an access, the address will be latched on the 
first or second positive edge of CLK after GRANTB has 
been negated depending on the programming bits RO, R1. 


1-219 


For Port B, if GRANTB is asserted, the address will be 
latched with AREQB asserted. If GRANTB is negated, the 
address will latch on the first or second positive edge of 
CLK after GRANTB is asserted depending on the program- 
ming bits RO, R1. 


7.2 ADDRESS PIPELINING 


Address pipelining is the overlapping of accesses to differ- 
ent banks of VRAM. If the majority of successive accesses 
are to a different bank, the accesses can be overlapped. 
Because of this overlapping, the cycle time of the VRAM 
accesses are greatly reduced. The DP8520A/21A/22A can 
be programmed to allow a new row address to be placed on 
the VRAM address bus after the column address hold time 
has been met. At this time, a new access can be initiated 
with ADS or ALE, depending on the access mode, while 
AREQ is used to sustain the current access. The DP8522A 
supports address pipelining for Port A only. This mode can 
not be used with page, static column or nibble modes of 
operations because the VRAM column address is switched 
back to the row address after CAS is asserted. This mode is 
programmed through address bit R8 (see Figures 45a and 
45b). 


During address pipelining in Mode 0, shown in Figure 45c, 
ALE cannot be pulsed high to start another access until 
AREQ has been asserted for the previous access for at 
least one period of CLK. DTACK, if programmed, will be 
negated once AREQ is negated. WAIT, if programmed to 
insert wait states, will be asserted once ALE and CS are 
asserted. 


In Mode 1, shown in Figure 45d, ADS can be negated once 
AREQ is asserted. After meeting the minimum negated 
pulse width for ADS, ADS can again be asserted to start a 
new access. DTACK, if programmed, will be negated once 
AREQ is negated. WAIT, if programmed, will be asserted 
once ADS is asserted. 

In either mode with either type of wait programmed, the 
DP8520A/21A/22A will still delay the access for precharge 
if sequential accesses are to the same bank or if a refresh 
takes place. 
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7.0 Additional Access Support Features (continued) 


(if programmed) 


COLINC 


0=8,9,10 


_. FIGURE 44. Column Increment 


RASO 
CASO 


ADDRESS ; COLUMN J NEXT ROW 


FIGURE 45a. Non-Address Pipelined Mode . 
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CASO 
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aooress —X ROW) 


FIGURE 45b. Address Pipelined Mode 
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TL/F/9338-81 © 


TL/F/9338-82 


Loon 


CLK \\ " i , f | 
ALE : : 
AREQ 7 , ! : 
ae od Lon ; 
: . i ; 

we = yh | 
RAS1 : Precharge 


CAST 
Q0=-8,9,10 . ROW XX COLUMN°X ROW XX COLUMN X  -ROW/ _—_———s XK COLUMN 1X ROW sO 
az : DELAYED ACCESS 
— — : —— watt | 
WAIT WAITO WAITO 2 
ACCESS #1 TO BANK O BEGINNING: ACCESS #1 TO BANK 0 ENDING : ACCESS #2 TO BANK 1 ENDING 


* ACCESS #2 TO BANK 1 BEGINNING : ACCESS #3 TO BANK 1 DELAYED BECAUSE OF PRECHARGE (2T) 
TL/F/9338-83 


FIGURE 45c. Mode 0 Address Pipelining (WAIT of 0, 12T has Been Programmed. 
WAIT is Sampled at the “T3” Falling Clock Edge) 


sainjea, woddns ssaooy jeuoippy 0'Z 


=—— pt eh 
Q0=8,9,10 _ KX ROW ) C] ROW} Sa ha 


rar a 
DTACK 15 


RFIP (RFRQ) 
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FIGURE 45d. Mode 1 Address Pipelining (DTACK 114T Programmed, DTACK is Sampled at the “T3” Falling Clock Edge) 
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7.0 Additional Access Support Features 


7.3 DELAY CAS DURING WRITE ACCESSES 

Address bit C9 asserted during programming will cause CAS 
to be delayed until the first positive edge of CLK after RAS 
is asserted when the input WIN is asserted. Delaying CAS 
during write accesses ensures that the data to be written to 


T1 


VRAM will be setup to CAS asserting as shown in Figures 
46a and 46b. |f the possibility exists that data still may not 
be present after the first positive edge of CLK, CAS can be 


delayed further with the ECAS inputs. If address bit C9 is 
negated during programming, read and write accesses will 
be treated the same (with regard to CAS). 


CLK 
ALE 
AREQ 
RAS 
CAS 
win 
MODE 0 READ ACCESS 7 MODE 0 WRITE ACCESS 
NORMAL CAS ASSERTION CAS CAN ASSERT ONLY AFTER FIRST POSITIVE — 
CLOCK EDGE AFTER RAS IS ASSERTED a 
TL/F/9338-85 
FIGURE 46a. Mode 0 Delay CAS 
11 12 3 71 2 3 
CLK A SP 
ADS 
AREQ 
RAS 
CAS aA 


wi | aaa oeaecaaieeecaecaaatainraas 


MODE 1 READ ACCESS 
NORMAL CAS ASSERTION 


a= MODE 1 WRITE ACCESS 
CAS CAN ASSERT ONLY AFTER FIRST POSITIVE 
CLOCK EDGE AFTER RAS IS ASSERTED 
TL/F/9338-86 


FIGURE 46b. Mode 1 Delay CAS 
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8.0 RAS and CAS Configuration Modes 


The DP8520A/21A/22A allow the user to configure the 
VRAM array to contain one, two or four banks of VRAM. 
Depending on the functions used, certain considerations 
must be used when determining how to set up the VRAM 
array. Programming address bits C4, C5 and C6 along with 
bank selects, BO-1, and CAS enables, ECASO-1, deter- 
mine which RAS or group of RASs and which CAS or group 
of CASs will be asserted during an access. Different memo- 
ry schemes are described. The DP8520A/21A/22A is spec- 
ified driving a heavy load of 72 VRAMs, representing four 
banks of VRAM with 16-bit words and 2 parity bits. The 
DP8520A/21A/22A can drive more than 72 VRAMs, but the 
AC timing must be increased. Since the RAS and CAS out- 
puts are configurable, all RAS and CAS outputs should be 
used for the maximum amount of drive. 


BO,B1 
RO=8,9,10 
C0=8,9,10 


ADDRESS 


DP8520A/21A/22A 


LOW BYTE 
HIGH BYTE 


ie 7 7 


8.1 BYTE WRITING 


By selecting a configuration in which all CAS outputs are 
selected during an access, each ECAS input enables a pair 
of CAS outputs to select a byte in a word size of up to 16 
bits. In this case, the RAS outputs are used to select which 
of up to 4 banks is to be used as shown in Figures 47a and 
47b. The user can also configure the VRAM array into an 8 
bank system as shown in Figure 47c. This setup can be 
used along with byte writing for an 8-bit system if the LOW 
BYTE and HIGH BYTE are connected to ECASO and 
ECAS1 respectively. The user can connect upper address 
bits to ECASO,1 for use in an 8 bank-16-bit system, but 
cannot use byte writing in this case. 


BYTEO BYTE1 


BYTEO BYTE1 


Pi =BYTEO BYTE1 


TL/F/9338-88 


FIGURE 47a. VRAM Array Setup for 16-Bit System (C6, C5, C4 = 1, 1, 0 during Programming) 
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8.0 RAS and CAS Configuration Modes (Continued) 


BO,B1 
RO=8,9, 10 
C0=8,9,10 


BYTEO BYTE1 


ADDRESS 


LOWER BYTE. 
UPPER BYTE 


DP8520A/21A/22A 


ECASO 
ECAS1 


FIGURE 47b. VRAM Array Setup for 16-Bit, 1 Bank System (C6, C5, C4 = 0, 0, 0 Allowing Erro 
or C6, C5, C4 = 0, 1, 1 No Error Scrubbing during Programming) 


BO,B1 
RO=8,9,10 
C0=8,9,10 


ADDRESS 


DP8520A/21A/22A 


UPPER ADDRESS BIT— 
UPPER ADDRESS BIT 


FIGURE 47c. 8 Bank VRAM Array (C6, C5, C4 = 1, 1,0 during Programming) 
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8.0 RAS and CAS Configuration Modes (Continued) 


8.2 MEMORY INTERLEAVING 


Memory interleaving allows the cycle time of VRAMs to be 
reduced by having sequential accesses to different memory 
banks. Since the DP8520A/21A/22A have separate pre- 
charge counters per bank, sequential accesses will not be 
delayed if the accessed banks use different RAS outputs. 
To ensure different RAS outputs will be used, a mode is 
selected where either one or two RAS outputs will be as- 
serted during an access. The bank select or selects, BO and 
B1, are then tied to the least significant address bits, caus- 
ing a different group of RASs to assert during each sequen- 
tial access as shown in Figure 48. |n this figure there should 
be at least one clock period of all RAS’s negated between 
different RAS’s being asserted to avoid the condition of a 
CAS before RAS refresh cycle. 


B 
RO=8,9,10 
CO=8,9,10 


ADDRESS 


LOWER BYTE 
UPPER BYTE 


DP8520A/21A/22A 


8.3 ADDRESS PIPELINING 


Address pipelining allows several access RASs to be as- 
serted at once. Because RASs can overlap, each bank re- 
quires either a mode where one RAS and one CAS are used 
per bank as shown in Figure 49a or where two RASs and 
two CASs are used per bank as shown in Figure 49b. In 
order to perform byte writing while using address pipelining, 
external gating on the CAS outputs must be used. If the 
array is not layed out this way, a CAS to a bank can be low 
before RAS, which will cause a refresh of the VRAM, not an 
access. To take full advantage of address pipelining, memo- 
ry interleaving is used. TO memory interleave, the least sig- 
nificant address bits should be tied to the bank select inputs 
to ensure that all “back to back” sequential accesses are 
not delayed, since different memory banks are accessed. 


TL/F/9338-90 


FIGURE 48. Memory Interleaving with Byte Writing Capability (C6, C5, C4 = 1, 1, 0 during Programming) 
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8.0 RAS and CAS Configuration Modes (continued) 


LOW ORDER ADR BIT 
Bi DP8520A/21A/22A 


LOW ORDER ADR #14 °° 
ADDRESS —« R0~8,9, 10 CASI 


C0=8,9,10 

may be tied 
to ground in 
this case. 


ECASO 
ECAS1 


‘enim 
—_ 


| _ . TL/F/9338-91 
FIGURE 49a. VRAM Array Setup for 4 Banks Using Address Pipelining (C6, C5, C4 = 1, 1, 1 
or C6, C5, C4 = 0, 1, 0 (Also Allowing Error Scrubbing) during Programming) 


DP8520A/21A/22A 


RO-8,9,10 
C0-8,9,10 


; TL/F/9338-92 
FIGURE 49b. VRAM Array Setup for Address Pipelining with 2 Banks (C6, C5, C4 = 1,0, 1 
or C6, C5, C4 = 0, 0, 1 (Also Allowing Error Scrubbing) during Programming) 


8.4 ERROR SCRUBBING — | : | | 
In error scrubbing during refresh, the user selects one, two as words. Since the CAS signals are not used to select 
or four RAS and CAS outputs per bank. When performing individual bytes, their corresponding ECAS inputs can be 


error detection and correction, memory is always accessed tied low as shown in Figures 50a and 50b. 


LOW ORDER ADR BIT 


LOW ORDER ADR BIT + 1 


ADDRESS 


DP8520A/21A/22A 


RO=8,9,10 
C0=8,9,10 


MAY BE TIED TO 
GROUND IN THIS 
CASE 


TL/F/9338-93 
FIGURE 50a. VRAM Array Setup for 4 Banks Using Error Scrubbing (C6, C5, C4 = 0, 1, 0 during Programming) 


DP8520A/21A/22A 


RO=8,9,10 
C0=8,9,10 


RAS2 
RASS 
CAS2 


ECASO 
ECAS1 


MAY BE TIED TO 
GROUND IN THIS 
CASE 


TL/F/9338-94 
FIGURE 50b. VRAM Array Setup for Error Scrubbing with 2 Banks (C6, C5, C4 = 0, 0, 1 during Programming) 
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8.0 RAS and CAS Configuration Modes (continuea) 


8.5 PAGE/BURST MODE 


In a static column, page or burst mode system, the least 
significant bits must be tied to the column address in order 
to ensure that the page/burst accesses are to sequential 
memory addresses, as shown in Figure 57. In a nibble mode 
system, the two least significant address bits (A2, A3) must 
be tied to the highest row and column address inputs (de- 
pends on VRAM size) to ensure that the toggling bits of 
nibble mode VRAMs are to sequential memory addresses. 


ADDRESS 
BITS AFTER 


LOW ORDER { 
COLUMN * 


ADDRESS AFTER BANK 


: 


2 


LOW ORDER BITS 


ECASO 


WRITE SECTION, J ____ 
ECAS1 


SEE BYTE 
FOR CONNECTION 


RO=8,9,10 


C0-8,9,10 


DP8520A/21A/22A 


The ECAS inputs may then be toggled with the DP8520A/ 
21A/22A’s address latches in fall-through mode, while 
AREQ is asserted. The ECAS inputs can also be used to 
select individual bytes. When using nibble mode VRAMS, 
the third and fourth address bits can be tied to the bank 
select inputs to perform memory interleaving. In page or 
static column modes, the two address bits after the page 
size can be tied to the bank select inputs to select a new 
bank if the page size is exceeded. 


SEE BYTE 
WRITE SECTION 
FOR CONNECTION 


SEE BYTE 
WRITE SECTION 
FOR CONNECTION 


TL/F/9338-95 


*See table below for row, column & bank address bit map. A0,A1 are used for byte addressing in this example. 


Page Mode/Static Column Mode Page Size 


Addresses | Nibble Mode* 
256 Bits/Page | 512 Bits/Page | 1024 Bits/Page | 2048 Bits/Page 


Column | R9,C9=A a Cco- ora Co- wean 10 | Co- aie 11 2 


*Assuming 1 M-bit Vrams are being used. 


Assume that the least significant address bits are used for byte addressing. Given a 32-bit system A0,A1 would be 


used for byte addressing. 
X = DON’T CARE, the user can do as he pleases. 


FIGURE 51. Page, Static Column, Nibble Mode System 
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9.0 Programming and Resetting 
The DP8520A/21A/22A must be programmed by one of 


two possible programming sequences before it can be used.. 


At power up, the DP8520A/21A/22A programming bits are 
in an undefined state. All internal latches and flip-flops are 


cleared. After programming, the DP8520A/21A/22A enters. 


a 60 ms initialization period. During this initialization period, 
the DP8520A/21A/22A performs refreshes about every 
15 ws; this makes further VRAM warmup cycles unneces- 
sary. The chip can be programmed as many times as the 
user wishes. After the first programming, the 60 ms initializa- 
tion period will not be entered into unless the chip is reset. 
Refreshes occur during the 60 ms initialization period. If 


ECASO was asserted during programming, the RFIP 


(RFRQ) pin will act as RFIP and will be asserted throughout 
the initialization period, otherwise the pin will act like RFRQ 
and toggle every 13 us-15 ps in conjunction with internal 
refresh requests. If the user attempts an access during the 
initialization period, wait states will be inserted into the ac- 
cess cycle until the initialization period is complete and RAS 
precharge time has been met. The actual initialization time 
period is given by the following formula: 
T = 4096* (Clock Divisor Select) 
*(Refresh Clock Fine Tune) 
/(DELCK Frequency) 


9.1 MODE LOAD ONLY PROGRAMMING 


MODE LOAD, ML, asserted enables an internal 23-bit pro- 
grammable register. To use this method, the user asserts 
ML, enabling the internal programming register. After ML is 
asserted, a valid programming selection is placed on the 
address bus (and ECAS0), then ML is negated: When ML is 
negated, the value on the address bus (and ECASO) is 
latched into the internal programming register and the 
DP8520A/21A/22A is programmed, as shown in Figure 
52a. After ML is negated, the DP8520A/21A/22A will enter 
the 60 ms initialization period only if this is the first program- 
ming after power up or reset. 


Using this method, a set of transceivers on the address bus 
can be put at TRI-STATE® by the system reset signal. A 
combination of pull-up and pull-down resistors can be used 
on the address inputs of the DP8520A/21A/22A to select 
the programming values, as shown in Figure 52b. 


9.2 CHIP SELECTED ACCESS PROGRAMMING 


The chip can also be programmed by asserting ML and per- 
forming a chip selected access. ADS (or ALE) is disabled 
internally until after programming. To program the chip using 
this method, ML is asserted. After ML is asserted, CS is 
asserted and a valid programming selection is placed on the 
address bus. When AREQ is asserted, the chip is pro- 
grammed with the programming selection on the address 
bus. After AREQ is negated, ML can be negated as shown 
in Figure 53a. 


SYSTEM POWER UP RESET 


*Pull-Up or Pull-Down Resistors on Each Address Input 


PROGRAMMING 
BITS VALID. 


FIGURE 52a. Mode Load Only Programming 


ADDRESS 
LATCHES 
AND 
DRIVERS 


ENABLE 


TL/F/9338-96 


-}ppas20/21/22 


MODE:.LOAD 


TL/F/9338-~97 


FIGURE 52b. Programming during System Reset 
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9.0 Programming and Resetting (continued) 


ML 


ADS OR KXXXXXXXXAAKAAKAKKKXAKAA AKA AAA AKA 


~ VWVVVVVVY 
cs XXX XX XX\ 


AREQ 


PRYKEKXIXK KX IKIKIKAX 


D FASO VALID PROGRAMMING SELECTION 
AND ECASO (| VALID PROGRAMMING SELECTION 


DP8520A/21A/22A 
PROGRAMMED 


TL/F/9338-98 


FIGURE 53a. CS Access Programming 


Using this method, various programming schemes can be 
used. For example if extra upper address bits are available, 
an unused high order address bit can be tied to the signal 
ML. Using this method, one need only write to a page of 
memory, thus asserting the high order bit and in turn pro- 
gramming the chip as shown in Figure 53b. 


RO=-9, CO=-9, BO, Bi 


and ECASO DP8520A/21A/22A 


ADDRESS 


TL/F/9338-99 
FIGURE 53b. Programming the DP8520A/21A/22A 
through the Address Bus Only 


An I/O port can also be used to assert ML. After ML is 
asserted, a chip selected access can be performed to pro- 
gram the chip. After the chip selected access, ML can be 
negated through the !/O port as shown in Figure 53c. 


SYSTEM RESET: RST 


DATA 


and ECASO 
ADDRESS . 


DP8520A/21A/22A 


TL/F/9338-A0 
FIGURE 53c. Programming the DP8520A/21A/22A 
through the Address Bus and an I/O Port 


Another simple way the chip can be programmed is the first 
write after system reset. This method requires only a flip- 
flop and an OR gate as shown in Figure 53d. At reset, the 
flip-flop is preset, which pulls the Q output low. Since WR is 
negated, ML is not enabled. The first write access is used to 
program the chip. When WR is asserted, ML is asserted. 
WR negated clocks the flip-flop, negates ML, and programs 
the DP8520A/21A/22A with the address and ECASO avail- 
able at that time. CS does not need to be asserted using 
this method. 


DP8520A/21A/22A 


sooness SXXRRN RXXRXKKKRKENE 


FIGURE 53d. Programming the DP8520A/21A/22A on the First CPU Write after Power Up 
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9.0 Programming and Resetting (Continued) nS ee eee 

9.3 EXTERNAL RESET Figure 54a, ML negated will program the chip. If ML is nega- 
ted before or at the same time as DISRFSH as shown in 
Figure 54b, the chip will not be programmed. After the chip 
is programmed, the 60 ms initialization period will be en- 

tered into if this is the first programming after power up or 
reset. 


At power up, all internal latches and flip-flops are cleared. 
The power up state can again be entered by asserting ML 
and DISRFSH for 16 positive edges of CLK. After resetting if 
the user negates DISRFSH before negating ML as shown in 


SS 16 CLOCKS aaa 


' 
DISRFSH \ Y 


FIGURE 54a. Chip Reset and Programmed 


TL/F/9338-A2 


|+————_ 16 POSITIVE EDGES OF CLK ————>| 


DISRFSH 


. TL/F/9338-A3 
FIGURE 54b. Chip Reset but Not Programmed 
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9.0 Programming and Resetting (Continued) 
9.4 PROGRAMMING BIT DEFINITIONS 


C6, C5, C4 
0,0,0 


0, 0, 1 


0,1,0 


0,1, 1 


Description 
Extend CAS/Refresh Request Select 


The CASn outputs will be negated with the RASn outputs when AREQ (or AREQB, DP8522A only) is 
negated. The RFIP pin will function as refresh in progress. During a video shift register load operation, the 
DT/OE output will be negated by either the 4th rising clock edge after the input VSRL asserts, or by the 
VSRL input negating, whichever occurs first, when this mode is programmed. 


The CASn outputs will be negated, during an access (Port A (or Port B, DP8522A only)) when their 
corresponding ECASn inputs are negated. This feature allows the CAS outputs to be extended beyond the 
RAS outputs negating. Scrubbing refreshes are NOT affected. During scrubbing refreshes the CAS outputs 
will negate along with the RAS outputs regardless of the state of the ECAS inputs. 

The RFIP output will function as ReFresh ReQuest (RFRQ) when this mode is programmed. The DT/OE 
output will be negated by the input VSRL negating when this mode is programmed. 


Access Mode Select 
ACCESS MODE 0: ALE pulsing high sets an internal latch. On the next positive edge of CLK, the access 


- (RAS) will start. AREQ will terminate the access. 


ACCESS MODE 1: ADS asserted starts the access (RAS) immediately. AREQ will terminate the access. 
Address Latch Mode 
ADS or ALE asserted for Port A or AREGB asserted for Port B with the appropriate GRANT latch the input 


row, column and bank address. 
The row, column and bank latches are fall through. 


Delay CAS during WRITE Accesses 


CAS is treated the same for both READ and WRITE accesses. 


During WRITE accesses, CAS will be asserted by the event that occurs last: CAS asserted by the internal 
delay line or CAS asserted on the positive edge of CLK after RAS is asserted. 


Row Address Hold Time 


Row Address Hold Time = 25 ns minimum 
Row Address Hold Time = 15 ns minimum 


Column Address Setup Time 


Column Address Setup Time = 10 ns minimum 
Column Address Setup Time = 0 ns minimum 


RAS and CAS Configuration Modes/Error Scrubbing during Refresh 


RASO-3 and CASO-3 are all selected during an access. For a particular CAS to be asserted, its 
corresponding ECAS input must be asserted. BO and B1 are not used during an access. Error scrubbing 
during refresh. 


RAS and CAS pairs are selected during an access by B1. For a particular CAS to be asserted, its 
corresponding ECAS input must be asserted. 

B1 = O during an access selects RASO-1 and CASO-1. 

B1 = 1 during an access selects RAS2-3 and CAS2-3. 

BO is not used during an Access. 

Error scrubbing during refresh. 


_ RAS and CAS singles are selected during an access by BO-1. For a particular CAS to be asserted, its 


corresponding ECAS input must be asserted. 

B1 = 0, BO = 0 during an access selects RASO and CASO. 
B1 = 0, BO = 1 during an access selects RAS1 and CAS1. 
Bi = 1, BO = 0 during an access selects RAS2 and CAS2. 
Bi = 1, BO = 1 during an access selects RAS3 and CASS. 
Error scrubbing during refresh. 


RASO-3 and CASO-3 are all selected during an access. For a particular CAS to be asserted, its 
corresponding ECAS input must be asserted. 

Bi, BO are not used during an access. 

No error scrubbing. (RAS only refreshing) 
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9.0 Programming and Resetting (Continued) 
9.4 PROGRAMMING BIT DEFINITIONS (Continued) 


Symbol 
C6, C5, C4 
1,0, 0 


1,0, 1 


1, 1,0 


5 Pe a 


C3 


C2,C1,C0 
0,0, 0 


Description 
RAS and CAS Configuration Modes (Continued) — 


RAS pairs are selected by B1. CASO-3 are all selected. For a particular CAS to be asserted, its 
corresponding ECAS input must be asserted. 

B1 = 0 during an access selects RASO-1 and CASO-3. 

B1 = 1 during an access selects RAS2- 3 and CASO-3. 

BO is not used during an access. 

No error scrubbing. 


RAS and CAS pairs are selected by B1. For a particular CAS to be asserted its corresponding ECAS input 
must be asserted. 


B1 = 0 during an access selects RASO-1 and CASO-1. 
B1 = 1 during an access selects RAS2-3 and CAS2-3. 
BO is not used during an access. 

No error scrubbing. 


RAS singles are selected by BO-—1. CASO-3 are all selected. For a particular CAS to be asserted, its 


corresponding ECAS input must be asserted. 
B1 = 0, BO = 0 during an access selects RASO and CASO-3. 
Bi = 0, BO = 1 during an access selects RAS1 and CASO-3. 


-B1 = 1, BO = 0 during an access selects RAS2 and CASO-3. 


Bi = 1,BO = 1 during an access selects RAS3 and CASO-3. 
No error scrubbing. 


RAS and CAS singles are selected by BO, 1. For a particular CAS to be asserted, its corresponding ECAS 


input must be asserted. 
B1 = 0, BO = 0 during an access selects RASO and CASO. 
B1 = 0, BO = 1 during an access selects RAS1 and CAS1. 


B1=1,B0=0 during an access selects RAS2 and CAS2. 


B1 = 1,B0 = 1 during an access selects RAS3 and CAS3. 
No error scrubbing. 


Refresh Clock Fine Tune Divisor 


Divide delay line/refresh clock further by 30 (If DELCLK/Refresh Clock Clock Divisor = 2 MHz = 15 us 
refresh period). 


Divide delay line/refresh clock further by 26 (If DELCLK/Refresh Clock Clock Divisor = 2 MHz = 13 ps 
refresh period). 


Delay Line/Refresh Clock Divisor Select 


Divide DELCLK by 10 to get as close to 2 MHz as possible. 
Divide DELCLK by 9 to get as close to 2 MHz as possible. 
Divide DELCLK by 8 to get as close to 2 MHz as possible. 
Divide DELCLK by 7 to get as close to 2 MHz as possible. 
Divide DELCLK by 6 to get as close to 2 MHz as possible. 
Divide DELCLK by 5 to get as close to 2 MHz as possible. 
Divide DELCLK by 4 to get as close to 2 MHz as possible. 
Divide DELCLK by 3 to get as close to 2 MHz as possible. 


_ Refresh Mode Select 


RASO-3 will all assert and negate at the same time during a refresh. 


Staggered Refresh. RAS outputs during refresh are separated by one positive clock edge. Depending on the 
configuration mode chosen, either one or two RASs will be asserted. 


Address Pipelining Select 


Address pipelining is selected. The VRAM controller will switch the VRAM column address back to the row 
address after guaranteeing the column address hold time. 


Non-address pipelining is selected. The VRAM controller will hold the column address on the VRAM address 
bus until the access RASs are negated. 
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9.0 Programming and Resetting (Continued) 
9.4 PROGRAMMING BIT DEFINITIONS (Continued) 


Symbol 
R7 


0 
1 


R6 


1,0 


1,1 


R3, R2 
0, 0 


0, 1 


1,0 


1,1 


R1,RO_ 


0,0 


0, 1 


1,0 


1,1 


Description 


_ WAIT or DTACK Select 


WAIT type output is selected. 
DTACK (Data Transfer ACKnowledge) type output is selected. 


Add Wait States to the Current Access if WAITIN is Low 


WAIT or DTACK will be delayed by one additional positive edge of CLK. 
WAIT or DTACK will be delayed by two additional positive edges of CLK. 


WAIT/DTACK during Burst (See Section 5.1.2 or 5.2.2) 


NO WAIT STATES; If R7 = 0 during programming, WAIT will remain negated during burst orion of access. 
If R7 = 1 programming, DTACK will remain asserted during burst portion of access. 


1T; If R7 = 0 during programming, WAIT will assert when the ECAS inputs are negated with AREQ asserted. 
WAIT will negate from the positive edge of CLK after the ECASs have been asserted. 

lf R7 = 1 during programming, DTACK will negate when the ECAS inputs are negated with AREQ asserted. 
DTACK will assert from the positive edge of CLK after the ECASs have been asserted. 


YoT; If R7 = 0 during programming, WAIT will assert when the ECAS inputs are negated with AREQ 
asserted. WAIT will negate on the negative level of CLK after the ECASs have been asserted. 

If R7 = 1 during programming, DTACK will negate when the ECAS inputs are negated with AREQ asserted. 
DTACK will assert from the negative level of CLK after the ECASs have been asserted. 


OT; If R7 = O during programming, WAIT will assert when the ECAS inputs are negated. WAIT will negate 
when the ECAS inputs are asserted. 
lf R7 = 1 during programming, DTACK will negate when the ECAS inputs are negated. DTACK will assert 
when the ECAS inputs are asserted. 


WAIT/DTACK Delay Times (See Section 5.1.1 or 5.2.1) 


NO WAIT STATES; If R7 = 0 during programming, WAIT will remain high during non-delayed accesses. 
WAIT will negate when RAS is negated during delayed accesses. 
NO WAIT STATES; If R7 = 1 during programming, DTACK will be asserted when RAS is asserted. 


WT; If R7 = O during programming, WAIT will negate on the negative level of CLK, after the access RAS. 
1T; If R7 = 1 during programming, DTACK will be asserted on the positive edge of CLK after the access 
RAS. 


NO WAIT STATES, 14T; If R7 = 0 during programming, WAIT will remain high during non-delayed accesses. 
WAIT will negate on the negative level of CLK, after the access RAS, during delayed accesses. 

1%T; lf R7 = 1 during programming, DTACK will be asserted on the negative level of CLK after the access 
RAS. 


1T; If R7 = 0 during programming, WAIT will negate on the positive edge of CLK after the access RAS. 
114T; lf R7 = 1 during programming, DTACK will be asserted on the negative level of CLK after the positive 
edge of CLK after the access RAS. 


RAS Low and RAS Precharge Time 


RAS asserted during refresh = 2 positive edges of CLK. 
AS precharge time = 1 positive edge of CLK. 
RAS will start from the first positive edge of CLK after GRANTB transitions (DP8522A). 


RAS asserted during refresh = 3 positive edges of CLK. 
RAS precharge time = 2 positive edges of CLK. 


aed 


RAS will start from the second positive edge of CLK after GRANTB transitions (DP8522A). 


RAS asserted during refresh = 2 positive edges of CLK. 
RAS precharge time = 2 positive edges of CLK. 
RAS will start from the first positive edge of CLK after GRANTB transitions (DP8522A). 


- RAS asserted during refresh = 4 positive edges of CLK. 


RAS precharge time = 3 positive edges of CLK. 
RAS will start from the second positive edge of CLK after GRANTB transitions (DP8522A). 


Note 1: The configuration modes allow RASs and CASs to be grouped such that each RAS and CAS will drive one-fourth of the total VRAM array whether the array 
is organized as 1, 2, or 4 banks. 

Note 2: !n order for a CAS output to go low during an access, it must be both selected and enabled. ECASO-1 are used to enable the CAS outputs (ECASO enables 
CASO,1; ECAS1 enables CAS2,3). Selection is determined by the configuration mode and B1, BO. 
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10.0 Test Mode 


Staggered refresh in combination with the error scrubbing 
mode places the DP8520A/21A/22A in test mode. In this 
mode, the 24-bit refresh counter is divided into a 13-bit and 
11-bit counter. During refreshes both counters are incre- 
mented to reduce test time. 


11.0 VRAM Critical Timing 
Parameters 


The two critical timing parameters, shown in Figure 55, that 
must be met when controlling the access timing to a VRAM 
are the row address hold time, tRAH, and the column ad- 
dress setup time, tASC. Since the DP8520A/21A/22A con- 
tain a precise internal delay line, the values of these param- 
eters can be selected at programming time. These values 
will also increase and decrease if DELCLK varies from 
2 MHz. 


11.1 PROGRAMMABLE VALUES OF tRAH AND tASC 


The DP8520A/21A/22A allow the values of tRAH and tASC 
to be selected at programming time. For each parameter, 
two choices can be selected. tRAH, the row address hold 
time, is measured from RAS asserted to the row address 
starting to change to the column address. The two choices 
for tRAH are 15 ns and 25 ns, programmable through ad- 
dress bit C8. 


tASC, the column address setup time, is measured from the 
column address valid to CAS asserted. The two choices for 
tASC are 0 ns and 10 ns, programmable through address bit 
C7. 


RAS 
CAS 


QOUT 


11.2 CALCULATION OF tRAH AND tASC. 


There are two clock inputs to the DP8520A/21A/22A. 
These two clocks, DELCLK and CLK can either be tied to- 
gether to the same clock or be tied to different clocks run- 
ning asynchronously at different frequencies. 


The clock input, DELCLK, controls the internal delay line 
and refresh request clock. DELCLK should be a multiple of 
2 MHz. If DELCLK is not a multiple of 2 MHz, tRAH and 
tASC will change. The new values of tRAH and tASC can be 
calculated by the following formulas: 


If tRAH was programmed to equal 15 ns then tRAH = 
30*(((DELCLK Divisor)* 2 MHz/(DELCLK Frequency))— 1) 
+ 15 ns. . 


If tRAH was programmed to equal 25 ns then tRAH = 
30*(((DELCLK Divisor)* 2 MHz/(DELCLK Frequency))— 1) 
+ 25 ns. 


lf tASC was programmed to equal 0 ns then tASC = 15* 
((DELCLK Divisor)* 2 MHz/(DELCLK Frequency)) — 15 ns. 


lf tASC was programmed to equal 10 ns then tASC = 25* 
((DELCLK Divisor)* 2 MHz/(DELCLK Frequency)) — 15 ns. 


Since the values of tRAH and tASC are increased or de- 
creased, the time to CAS asserted will also increase or de- 
crease. These parameters can be adjusted by the following 
formula: 


Delay to CAS = Actual Spec. + Actual tRAH — 
Programmed tRAH + Actual tASC — Programmed tASC. 


TL/F/9338—-A4 


FIGURE 55. tRAH and tASC 


1-234 


12.0 Dual Accessing Functions (DP8522A) 


The DP8522A has all the functions previously described. In 
addition to those features, the DP8522A also has the capa- 
bilities to arbitrate among refresh, Port A and a second port, 
Port B. This allows two CPUs to access a common VRAM 
array. VRAM refresh has the highest priority followed by the 
currently granted port. The ungranted port has the lowest 
priority. The last granted port will continue to stay granted 
even after the access has terminated, until an access re- 
quest is received from the ungranted port (see Figure 56a ). 
The dual access configuration assumes that both Port A 
and Port B are synchronous to the system clock. If they are 
not synchronous to the system clock they should be exter- 
nally synchronized (Ex. By running the access requests 
through several Flip-Flops, see Figure 58a). 


12.1 PORT B ACCESS MODES (DP8522A) 


Port B accesses are initiated from a single input, AREQB. 
When AREQB is asserted, an access request is generated. 
lf GRANTB is asserted and a refresh is not taking place or 
precharge time is not required, RAS will be asserted when 
AREQB is asserted. Once AREQB is asserted, it must stay 
asserted until the access is over. AREQB negated, negates 
RAS as shown in Figure 56b. Note that if ECASO = 1 during 
programming the CAS outputs may be held asserted (be- 
yond RASn negating) by continuing to assert the appropri- 
ate ECAS input (the same as Port A accesses). If Port B is 
not granted, the access will begin on the first or second 
positive edge of CLK after GRANTB is asserted (See RO, 
R1 programming bit definitions) as shown in Figure 56c, as- 


CLK 


AREQB 


RAS 


CAS 


suming that Port A is not accessing the VRAM (CS, ADS/ 
ALE and AREQ) and RAS precharge for the particular bank 
has completed. It is important to note that for GRANTB to 
transition to Port B, Port A must not be requesting an ac- 
cess at a rising clock edge (or locked) and Port B must be 
requesting an access at that rising clock edge. Port A can 
request an access through CS and ADS/ALE or CS and 
AREQ. Therefore during an interleaved access where CS 
and ADS/ALE become asserted before AREQ from the pre- 
vious access is negated, Port A will retain GRANTB = 0 
whether AREQB is asserted or not. 

Since there is no chip select for Port B, AREQB must incor- 


porate this signal. This mode of accessing is similar to Mode 
1 accessing for Port A. 


AREQA*AREQB*LOCK 


PORT A 


AREQA*AREQB*LOCK 


PORT B 


TL/F/9338—A5 
Explanation of Terms 


AREQA = Chip Selected access request from Port A 
AREQB = Chip Selected access request from Port B 
LOCK = Externally controlled LOCKing of the Port 


that is currently GRANTed. 

FIGURE 56a. DP8522A PORT A/PORT B ARBITRATION 
STATE DIAGRAM. This arbitration may take place 
during the “ACCESS” or “REFRESH” 
state (see Figure 7a). 


3 | 7 | 


Cf 


S 


TL/F/9338-A6 


FIGURE 56b. Access Request for Port B 


CLK 


AREQB 


GRANTB 


RAS 


CAS 


=e eee: 
a: 
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FIGURE 56c. Delayed Port B Access 
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12.0 Dual Accessing Functions (DP8522A) (Continued) 


12.2 PORT B WAIT STATE SUPPORT (DP8522A) — 


Advanced transfer acknowledge for Port B, ATACKB, is 
used for wait state tate support for Port B. This output will be 
asserted when RAS for the Port B access is asserted, as 
shown in Figures 57a and 57b. Once asserted, this output 
will stay asserted until AREQB is negated. With external 
logic, ATACKB can be made to interface to any CPU’s wait 
input as shown in Figure 57c. 


ATACKB ATACKB TO CPU B 


CLK 


TL/F/9338-A8 


A) Extend ATACK to 14T (1% Clock) after RAS goes low. 


ATACKB ATACKB TO CPU B 


CLK 


TL/F/9338-—A9 
B) Extend ATACK to 1T after RAS goes low. 


CLK © 


AREQB XY 


ATACKB 


ATACKB 
| TO CPU B 


CPU B CLOCK 
TL/F/9838-B0 


C) Synchronize ATACKB B to CPU B Clock. This is useful if CPU B runs asyn- 
chronous. to the DP8522. . 


FIGURE 57c. Modifying Wait Logic for Port B 


12.3 COMMON PORT A AND PORT B DUAL PORT 
FUNCTIONS 


An input, LOCK, and an output, GRANTB, add additional 
functionality to the dual port arbitration logic. LOCK allows 
Port A or Port B to lock out the other port from the VRAM. 
When a Port is locked out of the VRAM, wait states will be 
inserted into its access cycle until it is allowed to access 
memory. GRANTB is used to multiplex the input control sig- 
nals and addresses to the DP8522A. 


12.3.1 GRANTB Output 

The output GRANTB determines which port has current ac- 
cess to the VRAM array. GRANTB asserted signifies Port B 
has access. GRANTB negated signifies Port A has access 
to the VRAM array. : 


SH 


FIGURE 57a. Non-Delayed Port B Access 


CLK A r 


AREQB 


GRANTB 


ATACKB 4 


RAS 


TL/F/9338-B1 


TL/F/9338-B2 


FIGURE 57b. Delayed Port B Access 


12.0 Dual Accessing Functions (DP8522A) (continued) 


Since the DP8522A has only one set of address inputs, the the VRAM array, the GRANTB output will transition from a 
signal is used, with the addition of buffers, to allow the cur- rising clock edge from AREQ or AREQB negating and will 
rently granted port’s addresses to reach the DP8522A. The preceed the RAS for the access by one or two clock peri- 
signals which need to be bufferred are RO-10, CO-10, ods. GRANTB will then stay in this state until the other port 
BO-1, ECASO-—1, and LOCK. All other inputs are not com- requests an access and the currently granted port is not 
mon and do not have to be buffered as shown in Figure 58a. accessing the VRAM as shown in Figure 58b. 


lf a Port, which is not currently granted, tries to access 


Dual Accessing Using the DP8522A 


PORT A PORT B 
(SYNCHRONOUS) (SYNCHRONOUS OR 


ASYNCHRONOUS) 
CLOCK 
CLOCK 
CLK, DLV CLK 


agg ( CONTROL] Request ano "conor 
i Nar ———__,| Sasa [War 


LOGIC LOGIC * 


PAL — = 
om WE, LOCK ENB C = 
WE, LOCK 


DP8522A WE. LOCK CPU 
ADDRESS, WE, LOCK TRI= STATE col 
; BUFFERS t ADDRESS | 
Uieielalls ease ) 


GRAPHICS fg WAIT 


PROCESSOR 


ADDRESS 


BUFFERS [ADDRESS 


PLANE Te TO COLOR LOOKUP TABLE, 
——— : CONVERTERS AND CRT 


Sire CiGCe rR VEO DRA N= PLANES (BANKS) of VIDEO RAMS 
PIXEL CLOCK FOR VIDEO SHIFT REGISTER (May also contain conventional DRAMS) 


DIR, CS, DBE ENB DIR, CS, DBE 
e 


*If Port B is synchronous the Request Synchronizing logic will not be required. 


FIGURE 58a. Dual Accessing Using the DP8522A to interface a CPU and a Graphics Processor to a Video RAM System 


TL/F/9338—-1 
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12.0 Dual Accessing Functions (DP8522A) (Continued) 


CLK 


ADS 


GRANTB 


R/W 


PORT B DATA 


ECASO = 3 
CASO = 3 


ADDRESS IN 


PORT A 


ATACKB 


FIGURE 58b. Wait States during a Port B Access 


42.3.2 LOCK Input | 


When the LOCK input is asserted, the currently granted port 
can “lock out” the other port through the insertion of wait 
States to that port’s access cycle. LOCK does not disable 
refreshes, it only keeps GRANTB in the same state even if 
the other port requests an access, as shown in Figure 59. 


a J. 
Saeee 


LOCK can be used by either port. The user must guarantee | 


that LOCK is not asserted with Port B granted when 
AVSRLRO is asserted to request a VRAM transfer cycle. 


GRANTB 


AREQ Ff 


= 


PORT A 


PORT B PORT A 


TL/F/9338-B3 


12.4 DUAL PORTING AND VRAM TRANSFER CYCLES 


The input AVSRLRQ disables any further internally or exter- 
nally requested refreshes or Port B access requests from 
being executed. The AVSRLRQ input does this by making 
the VRAM controller arbitration logic think that a Port A ac- 
cess is in progress from the point where the AVSRLRQ in- 
put goes low until the VRAM shift register load operation is 
completed. 


ATACKB 


TL/F/9338-B4 


FIGURE 59. LOCK Function 
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13.0 Absolute Maximum Ratings (note 1) 


lf Military/Aerospace specified devices are required, 
please contact the National Semiconductor Sales 
Office/Distributors for availability and specifications. 


Temperature under Bias ................... 0°C to + 70°C 
Storage Temperature ................. —65°C to + 150°C 


All Input or Output Voltage 
with Respect to GND................... —0.5V to +7V 


Power Dissipation @ 20 MHZ.................02eeee 0.5W 
ESD Rating to be determined. 


14.0 DC Electrical Characteristics 1, = 0°c to +70°C, Voc = 5V +10%, GND = OV 


Symbol Conditions | Min | Typ | Max | Units 
Logical 1 Input Voltage Tested with a Limited 
2.0 
Functional Pattern 
Logical 0 Input Voltage Tested with a Limited 
Functional Pattern 


Q and WE Outputs lon = —10mA pM OO) 


VIH 

VIL 

VOH1 

Vou4 Q and WE Outputs lo. = 10mA 
VoH2 All Outputs except Qs, WE | Ioy = —3 mA 


< 


Vous an ae 

iN -1o | | 10 | HA 

Imm. | MLInputCurrent(Low) | Vn=GND | S| S| tsp 

Icc1__| Standby Current | CLK at@MHz(Vin=Vocor@ND) | —Sss|| 6 =| 15 —|_smA 

Icc1__| StandbyCurrent | CLKat20MHz(Vin=VocorGnd) | ss | 8 | 17__—«|_smA 

SS CLK at 25 MHz (Vin = Voc or GND) | 10 | 20 =| mA 

lees CLK at 8 MHz (Inputs Active) po] 0 | a 
(ILoAD = 9) (Vin = Voc or GND) 

is CLK at 20 MHz (Inputs Active) po fe i 
(ILoAD = 9) (Vin = Voc or GND) 

loce Supply Current CLK at 25 MHz (Inputs Active) ps0 | os ak 
(ILoaD = 9) (Vin = Vcc or GND) 

Civ" | InputCapacitance | fivatiMHz | TT 


*Note: Ci, is not 100% tested. 


15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A 


Two speed selections are given, the DP8520A/21A/22A-20 
and the DP8520A/21A/22A-25. The differences between 
the two parts are the maximum operating frequencies of the 
input CLKs and the maximum delay specifications. Low fre- 
quency applications may use the ‘‘—25” part to gain im- 
proved timing. 

The AC timing parameters are grouped into sectional num- 
bers as shown below. These numbers also refer to the tim- 
ing diagrams. 

1-38 Common parameters to all modes of operation 
50-56 Difference parameters used to calculate; 

RAS low time, 

RAS precharge time, 

CAS high time and 

CAS low time 

Common dual access parameters used for Port 
B accesses and inputs and outputs used only in 
dual accessing 


200-212 Refresh parameters 


100-121 
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300-315 Mode 0 access parameters used in both single 


and dual access applications 

Mode 1 access parameters used in both single 
and dual access applications 

Special Mode 1 access parameters which super- 
sede the 400-416 parameters when dual ac- 
cessing 

500-506 Programming parameters 

600-605 Graphics parameters for VRAM transfer cycles 
Unless otherwise stated Voc = 5.0V +10%, 0 < Ta < 
70°C, the output load capacitance is typical for 4 banks of 
18 VRAMs per bank, including trace capacitance (see Note 
2). 

Two different loads are specified: 

C. = 50 pF loads on all outputs except 

C. = 150 pF loads on Q0-8, 9, and 10; or 


400-416 


450-455 


Cy = 50 pF loads on ail outputs except 
Cy = 125 pF loads on RASO-3 and CAS0-3 and 
Cy = 380 pF loads on Q0-8, 9, and 10. 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (Continued) 


| eer 
Number Symbol Common Parameter 


1 fCLK CLK Frequency hair 
2 tCLKP CLK Period | 50. | 
3,4 |tCLKPW | CLK Pulse Width 


5 _—| fDCLK DELCLK Frequency | 5 


6 tDCLKP DELCLK Period 


7,8 | tDCLKPW | DELCLK Pulse Width 


9a tPRASCASO | RAS Asserted to CAS Asserted 
((RAH = 15 ns, tASC = Ons) 
9b tPRASCAS1 | RAS Asserted to CAS Asserted 
(t(RAH = 15 ns, tASC = 10 ns) 


9c tPRASCAS2 | (RAS Asserted to CAS Asserted 40 
(tRAH = 25ns, tASC = Ons) 
50 


(tRAH = 25 ns, tASC = 10 ns) 
toa |tRAH__—| RowAddressHold Time (tRAH = 15) | 15 | | 15 || 15 | 
1b |tRAH || RowAddressHoldTime(tRaH = 25) | 25 | | 25 | | 25 | | 28 
11a_|tasc___—[ CoumnAddressSetupTimetasc=0) | o | | o | | o | | 0 
1h __|tasc___—[ Column Address Setup Timectasc=10)| to | | 10 | | to | | 10 


cl A 
following Precharge 
13__|tPARQRAS | AREGNegatedtoRASNegated |_| a8 | | 43 | | at | 
14 | PENCL__| ECASO-tAssortedtoTASAsserted | | 23 | | at | | 20 | | 
18 __|#PENCH  [ECAS0-1NegatedtoCASNegated | | 25 | | a3 | | 20 | 

PARQCAS | AREGNegatedtoCASNegated |_| eo | | e8 | | a7 | 
17___|torkwi |cuktoWaAITNegted | | oo | | co || 


18 tPCLKDLO | CLK to DTACK Asserted 
| (Programmed as DTACK of 1/2, 1, 14 
or if WAITIN is Asserted) 
19 tPEWL ECAS Negated to WAIT Asserted 40 40 34 
during a Burst Access 
20 tSECK ECAS Asserted Setup to CLK High to 
24 19 19 


_ 
o 


Recognize the Rising Edge of CLK 
during a Burst Access : 
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8520A/21A/22A-20 8520A/21A/22A-25 
Description tame be Ci 


@ 


4 


15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 
Unless otherwise stated Voco = 5.0V + 10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: Cy = 50 pF loads on all outputs except 
Ci. = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-3 and CAS0-3 and 
Cy. = 150 pF loads on QO-8, 9 and 10; or CH = 380 pF loads on QO-8, 9 and 10. - 


8520A/21A/22A-20 8520A/21A/22A-25 
Common Parameter 
Description 


Number 


, 
rm 


oO 
= 


ECAS Asserted to DTACK 


21 tPEDL 


V2eS8d0/VlcS8dd0/Vv0cS8dd 


48 


® 
@ 


Asserted during a Burst Access 
(Programmed as DTACKO) 


22 ~—«| tPEDH ECAS Negated to DTACK 
Negated during a Burst Access 
23 tSWCK WAITIN Asserted Setup to CLK 


26 tPAQ Row, Column Address Valid to 
QO0-8, 9, 10 Valid 


27 tPCINCQ COLINC Asserted to Q0-8, 9, 10 
incremented 


28 tSCINEN COLINC Asserted Setup to ECAS 
Asserted to Ensure taSC = Ons 


29a tSARQCK1 | AREQ Negated Setup to CLK 
High with 1 Period of Precharge 
29b tSARQCK2 | AREQ Negated Setup to CLK High with 
> 1 Period of Precharge Programmed 
30 tPAREQDH | AREQ Negated to DTACK Negated 


31 tPCKCAS | CLK High to CAS Asserted 
when Delayed by WIN 


® 
© 


G 
Oo 


=| 


Go 
ico) 


17 


1 


—_ 
N 


BAS 


{ee 
Eg 


on 


27 
32 


32 tSCADEN | Column Address Setup to ECAS 
Asserted to Guarantee tASC = 0 


| 33 tWCINC COLINC Pulse Width 


34a tPCKCLO CLK High to CAS Asserted following 
Precharge (tRAH = 15 ns, tASC = Ons) 


34b tPCKCL1 CLK High to CAS Asserted following 
Precharge (tRAH = 25 ns, tASC = Ons) 


34c tPCKCL2 CLK High to CAS Asserted following 
Precharge (tRAH = 25 ns, tASC = Ons) 


34d tPCKCL3 CLK High to CAS Asserted following 
Precharge (tRAH = 25 ns, tASC = 10 ns) 


35 tCAH Column Address Hold Time | 
| (Interleave Mode Only) 
36 tPCQR CAS Asserted to Row Address 
Valid (Interleave Mode Only) 
37 tPCASDTH | CAS Asserted to DT/OE 
Asserted for a VRAM Read Access 
38 tPCASDTL | CAS Negated to DT/OE 
Negated for a VRAM Read Access 


5 
15 
4 


1 


— 
o 


79 


48 
89 
1 


89 


99 


90 


24 


24 


ide) © 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (Continued) . 
Unless otherwise stated Vocg = 5.0V +10%, 0°C < Ta < 70°C, the output load capacianee is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: . Cy = 50 pF ious: on all outputs exeant | 
C._ = 50 pF loads on all outputs except | | Cy = 125 pF loads on RASO-3 and CASO-3 and 
C._ = 150 pF loads on QO-8, 9 and 10; or e3 Cy = 380 pF loads on Q0-8, 9 and 10. 


8520A/21A/22A-20 8520A/21A/22A-25 


Difference 
Parameter Description 


Number 


50 tD1 (AREQ or AREQGB Negated to RAS 


Negated) Minus (CLK High to RAS 
Asserted) 


51 (CLK High to Refresh RAS Negated) | 
Minus (CLK High to RAS Asserted) 
52 tD3a_ | (ADS Asserted to RAS Asserted 

(Mode 1)) Minus (AREQ Negated 

to RAS Negated) 


-S 


5 


53 tD3b | (CLK High to RAS Asserted (Mode 0)) 
Minus (AREQ Negated to RAS Negated) 

54 (ECAS Asserted to CAS Asserted) 
Minus (ECAS Negated to CAS Negated) 

55 (CLK to Refresh RAS Asserted) Minus 
(CLK to Refresh RAS Negated) 


56 (AREQ Negated to RAS Negated) 


| 
“N 
N 


awh 
mb i 


oO 


Minus (ADS Asserted to RAS 
Asserted ((Mode 1)) 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continuea) 
Unless otherwise stated Vcc = 5.0V + 10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


O 
~ 
co 
v1 
NO 
© 
> 
Two different loads are specified: Cy = 50 pF loads on all outputs except G 
C. = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-—3 and CASO-3 and ~ 
Ci = 150 pF loads on Q0-8, 9 and 10; or Cy = 380 pF loads on Q0-8, 9 and 10. = 
nh 
8520A/21A/22A-20 8520A/21A/22A-25 > 
Number paranjalar Deectionon re ee = 
[Min | Max | Min | Max | Min | Max | Min | Max | & 
100 |tHCKARQB |AREQGBNegatedHeldfromCLKHigh | 3 | | 3 | Ja | fsa N 
101 | tSARQBCK |AREQBAssertedSetuptocLKHigh | 8 | | 8 | |7 | [74 > 
102 | tPAQBRASL_ | AREGB Asserted to RAS Asserted | 87 | ae 0 | |g 
103 | tPAQBRASH_ | AREGB Negated to RAS Negated | | a7 | | 42] | 2 | | 33 
105 tPCKRASG CLK High to RAS Asserted for 48 42 
Pending Port B Access 
106 | tPAQBATKBL | AREQBAssertedtoATACKBAsserted | | 48 | | 48 | | 38 | | 38 
107 tPCKATKB CLK High to ATACKB Asserted 47 47 
for Pending Access 
108 tPCKGH CLK High to GRANTB Asserted | | 3a | | 38 | 80 «| | 80 
109 | tPCKGL CLK High to GRANTB Negated | | a2 | 32 || ee || 
110 tSADDCKG Row Address Setup to CLK High That 
Asserts RAS following a GRANTB 15 11 16 
_ | Change to Ensure tASR = Ons for Port B 
111 tSLOCKCK LOCK Asserted Setup to CLK Low 
to Lock Current Port | 
112 tPAQATKBH | AREGNegatedtoATACKBNegated | | 26 | | 26 | | at {| 21 
113 | tPAQBCASH | AREGB Negated to CAS Negated ee ea. 
114 tSADAQB Address Valid Setup to 7 11 7 12 
AREQB Asserted | 
116 |tHCKARQG |AREGNegatedHeldfromCLKHigh | 3 | | 3 | [3 |] [3] 
117 tWAQB AREQB High Pulse Width 34 31 
to Guarantee tASR = Ons : 
118a tPAQBCASO | AREQB Asserted to CAS Asserted 88 
(tRAH = 15 ns, tASC = Ons) 
118b tPAQBCAS1 | AREQB Asserted to CAS Asserted 113 91 98 
(tRAH = 15 ns, tASC = 10 ns) 
118¢c tPAQBCAS2 | AREQB Asserted to CAS Asserted 413 91 98 
(tRAH = 25 ns, tASC = Ons) 
118d tPAQBCAS3 |} AREQB Asserted to CAS Asserted 
120a tPCKCASGO | CLK High to CAS Asserted 
for Pending Port B Access 101 93 
(tRAH = 15 ns, tASC = Ons) 
120b tPCKCASG1 | CLK High to CAS Asserted 
for Pending Port B Access 111 103 
(tRAH = 15 ns, tASC = Ons) 
120c tPCKCASG2 | CLK High to CAS Asserted 
for Pending Port B Access 111 103 
(tRAH = 25 ns, tASC = Ons) 
120d tPCKCASG3_ | CLK High to CAS Asserted 
for Pending Port B Access 106 113 
(tRAH = 25 ns, tASC = 10 ns) 
121 tSBADDCKG | Bank Address Valid Setup to CLK 
High That Starts RAS 10 
for Pending Port B Access 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued 
Unless otherwise stated Voc = 5.0V +10%, 0°C < Ta < 70°C, the output load capac 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: | 
Ci = 50 pF loads on all outputs except 


C_ = 150 pF loads on Q0-8, 9 and 10; or 


Number 


200 
201 
202 
204 
205 
206 
207 
208 
209a 


209b 


209c 


209d 


210 
211 
212 


itance is typical for 4 banks of 1 


50 pF loads on all outputs except 
125 pF loads on RASO-3 and CASO-3 and 
380 pF loads on Q0-8, 9 and 10. 


8520A/21A/22A-20 


| . 8520A/21A/22A-25 
-Refresh Parameter 


CLK High to CAS Asserted 
during Error Scrubbing 
(tRAH = 15 ns, tASC = Ons) 


, 


= 
io 
> 


_—s 


3|F 


=—h 


= 
ao; @ 
Nh 


Ol. oro ims 


CLK High to CAS Asserted 


during Error Scrubbing 
(tRAH = 15ns, tASC = 10 ns) 


CLK High to CAS Asserted 
during Error Scrubbing 


ao; > 
QO | oO ine) 


(tRAH = 25 ns, tASC = Ons) 


CLK High to CAS Asserted 
during Error Scrubbing 
((RAH = 25 ns, tASC = 10 ns) 


AFSH Pulse Width | | 
CLKHightoRFRGAsserted | | 
CLKHightoRFRGNegated | | 50 | 


—_h 


nm jaja 
ro) 
x 


on 


aa 
oO 
= 


io) 


8 VRAMs 


34 
50 
51 
33 
27 


80 


90 


90 


100 


15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 
Unless otherwise stated Voc = 5.0V +10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: Cy = 50 pF loads on all outputs except 
-C_ = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-3 and CASO-3 and 
C_ = 150 pF loads on Q0-8, 9 and 10; or Cy = 380 pF loads on Q0-8, 9 and 10. 


8520A/21A/22A-20 8520A/21A/22A-25 


Number Parameter Description CL CL CH 


goo__[tscsck | CSAssertedtocikHign | 14 | | 14 | 3 | 18 | 


301a tSALECKNL | ALE Asserted Setup to CLK High 

Not Using On-Chip Latches or 

if Using On-Chip Latches and 

BO, B1, Are Constant, Only 1 Bank 

301b° | tSALECKL | ALE Asserted Setup to CLK High, pel fa) fal jaw 


if Using On-Chip Latches if BO, B1 
Can Change, More Than One Bank 
303 | tSBADDCK | Bank Address ValidSetuptoCLKHigh | 20 | | 20] | 18 | | 18 


VecS8d0/Vl2S8dd/V0cS8dd 


304 tSADDCK Row, Column Valid Setup to 
CLK High to Guarantee 11 11 16 
tASR = Ons 
305 tHASRCB Row, Column, Bank Address 
Held from ALE Negated 10 10 
(Using On-Chip Latches) 
306 tSRCBAS Row, Column, Bank Address 


Setup to ALE Negated 
(Using On-Chip Latches) 


so7[POKRL | GLKHigh toRAS Assored «| +i ay | | | [|| 2 


308a tPCKCLO CLK High to CAS Asserted 70 79 
(tRAH = 15 ns, tASC = Ons) 

308b tPCKCL1 CLK High to CAS Asserted 89 
(tRAH = 15 ns, tASC = 10 ns) 

308c tPCKCL2 CLK High to CAS Asserted 89 
(tRAH = 25 ns, tASC = Ons) 

308d tPCKCL3 CLK High to CAS Asserted 99 
(tRAH = 25 ns, tASC = 10 ze) 


309 
: =i eee 
to Guarantee CAS is peepee 
311 22 
312 22 


313 tPCLKDL1 CLK High to DTACK Asserted 44 At 33 
(Programmed as DTACKO) 

314 tPALEWL ALE Asserted to WAIT Asserted 48 48 39 
(CS is Already Asserted) 


AREQ Negated to CLK High That Starts 
Access RAS to Guarantee tASR = Ons 
(Non-Interleaved Mode Only) 


315 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (Continued) 
Unless otherwise stated Voc = 5.0V +10%, 0°C < Ty < 70°C, the output load capacitance is typical for 4. peak of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: Cy = 50 pF loads on all outputs except _ 
Ci = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-3 and CASO-3 and 
Cy. = 150 pF loads on Q0-8, 9 and 10; or Cy = 380 pF loads on Q0-8, 9 and 10.. 


: | 8520A/21A/22A-20 8520A/21A/22A-25 
Number ) Parameter Description CL . : CL _CH 


mae [Min | Max | Min | Max 
sooa | GADSOKI |ADSAssoredSoupwocicrigh | e| |e] [7] |7_ 


400b tSADSCKW | ADS Asserted Setup to CLK | | 
(to Guarantee Correct WAIT 31 | 31 
or DTACK Output; Doesn’t Apply for DTACKO) . . 


401 pS eee ee 


402 I tPADSRL | ADS Asserted to |ADS AssertedtoRASAsserted Asserted aes a = 29 
403a tPADSCLO | ADS Asserted to CAS Asserted 80 
(tRAH = 15 ns, tASC = Ons) 


403b tPADSCL1 | ADS Asserted to CAS Asserted 404 92 
(tRAH = 15 ns, tASC = 10 ns). 

403c tPADSCL2 | ADS Asserted to CAS Asserted 404 
(tRAH = 25ns, tASC = 10 ns) . | 

403d tPADSCL3_ | ADS Asserted to CAS Asserted 

ii ‘aoa (RAH = 25 ns, tASC = 10 ns) lc eh es 
tSADDADS | Row Address Valid Setup to ADS 43 

Asserted to Guarantee tASR = Ons 


i<e) 
Le) 


|} 405 tHCKADS | ADS Negated Held from CLK High 


406 | tSWADS | WAITINAssertedSetuptoADS | 
| Asserted to Guarantee DTACKO 
IsDelayed 
407 | tSBADAS | Bank Address Setup to ADS Asserted Ee aa | ate 
408 tHASRCB_ | Row, Column, Bank Address Held from 40 
ADS Asserted (Using On-Chip Latches) 
409 tSRCBAS __| Row, Column, Bank Address Setup to 
ADS Asserted oie er aaa On-Chip Latches) 


410 | twADSH_ | ADS [ADS Negated Pulse width Pulse Width fa2 | | tw] [aw] | 


411 5 


3 
412 
(to Guarantee CAS Delayed during 
Writes Accesses) 
413 tPADSWLO | ADS Asserted to WAIT Asserted 46 46 37 37 
(Programmed as WAITO, Delayed Access) 


es 


414 “PADSWL1 ADS Asserted to WAIT Asserted 38 31 3 
(Programmed WAIT 1/2 or 1) 
415 tPCLKDL1 | CLK High to DTACK Asserted 
(Programmed as DTACKO, 41 41 
Delayed Access) - | | | 
416 AREQ Negated to ADS Asserted 
to Guarantee tASR = Ons 42 | 31 
(Non Interleaved Mode Only) 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continueq) 
Unless otherwise stated Voc = 5.0V +10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: Cy = 50 pF loads on all outputs except 
C. = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-3 and CASO-3 and 
CL = 150 pF loads on Q0-8, 9 and 10; or Cy = 380 pF loads on Q0-8, 9 and 10. 


8520A/21A/22A-20 | 8520A/21A/22A-25 


Mode 1 Dual Access 
Parameter Description 


Row Address Setup to CLK High That 


Asserts RAS following a GRANTB 


450 tSADDCKG 
Port Change to Ensure taSR = Ons 
451 tPCKRASG CLK High to RAS Asserted 
for Pending Access 
452 tPCLKDL2 CLK to DTACK Asserted for Delayed 
Accesses (Programmed as DTACKO) 


111 119 


453a tPCKCASGO | CLK High to CAS Asserted 
for Pending Access 
(tRAH = 15 ns, tASC = Ons) 
453b tPCKCASG1 | CLK High to CAS Asserted 
for Pending Access 
(tRAH = 15 ns, tASC = 10 ns) 
453c tPCKCASG2 | CLK High to CAS Asserted 
for Pending Access 
(tRAH = 25 ns, tASC = Ons) 
453d tPCKCASG3 | CLK High to CAS Asserted 
for Pending Access 
(tRAH = 25ns, tASC = 10 ns) 
454 tSBADDCKG | Bank Address Valid Setp to CLK High 
That Asserts RAS for Pending Access 
455 tSADSCKO ADS Asserted Setup to CLK High 


113 


ae 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 
Unless otherwise stated Voc = 5.0V +10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


Two different loads are specified: ey , Cy 
C. = 50 pF loads on all outputs except — ah 
C. = 150 pF loads on Q0-8, 9 and 10; or 


50 pF loads on all outputs except _ 
125 pF loads on RASO-3 and CAS0O-3 and 
380 pF loads on Q0-8, 9 and 10.: 


mo | #8 8520A/21A/22A-20 8520A/21A/22A-25 
Number Susiceatteeaaien Pom | mw | mw | 
500 tHMLADD | Mode Address Held from ML Negated cae eae rams 
501 {SADDML_| Mode Address SetuptoMLNegated | 6 | | 6. 
502 | tWML__| ML Pulse Width ce ee eee 
503 tSADAQML | Mode Address Setup to AREQ Asserted Po | fof | + 
504 tHADAQML | Mode Address Held from AREQ Asserted fae. | | 39 | | 20 | 
505 tSCSARQ CS Asserted Setup to 

AREQ Asserted 


506 Fo ae ee ee Se 


tll 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 
Unless otherwise stated Vcc = 5.0V +10%, 0°C < Ta < 70°C, the output load capacitance is typical for 4 banks of 18 VRAMs 
per bank, including trace capacitance (see Note 2). 


oO 
mv | 
oo 
oi 
NO 
© 
> 
Two different loads are specified: Cy = 50 pF loads on all outputs except Go 
C. = 50 pF loads on all outputs except Cy = 125 pF loads on RASO-3 and CASO-3 and 0 
Ci = 150 pF loads on QO-8, 9 and 10; or Cy = 380 pF loads on QO-8, 9 and 10. = 
Nh 
8520A/21A/22A-20 | 8520A/21A/22A-25 > 
Number he seat Pom | mw | a | ow |S 
Parameter Description ry ~o 
© 
| min | Max | min | Max | Min | Max | Min | Max | $ 
600 tSCKVSRL_ | VSRL Low Setup to CLK Rising Edge to / Re 
guarantee counting VSRL as being Low 7 7 > 
(used to determine when to end Graphics 
Shift load access) . : 
601 tHVSRLCK | VSRL Low from CLK High (to guarantee | 
VSRL is not counted as being Low until 4 
the next rising clock edge) 
602 tSCKAVSRL | AVSRLRQ Low before CLK Rising Edge to 
guarantee locking the VRAM to only Port 13 13 12 12 
A accesses 
603 tPVSRLDTL | VSRL Low to DT/OE Low during Graphics 
33 
Shift Register Load access 
604 | tPVSRLDTH | VSRL Negated to DT/OE Negated | ff aa || a || ce | |e 
605 | tPCKOTL__| OLK to DT/OE Negated | two] {ofl f{a| |» 


Note 1: “Absolute Maximum Ratings” are the values beyond which the safety of the device cannot be guaranteed. They are not meant to imply that the device 
should be operated at these limits. The table of ‘Electrical Characteristics” provides conditions for actual device operation. 


Note 2: Input pulse OV to 3V; tR = tF = 2.5 ns. Input reference point on AC measurements is 1.5V. Output reference points are 2.4V for High and 0.8V for Low. 
Note 3: AC Production testing is done at 50 pF. 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (Continued) 


6 


DELCLK 


TL/F/9338-B5 


FIGURE 60. Clock, DELCLK Timing 
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AREQB | - aa 
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ATACKB 
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RASn 
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RO=8,9,10 


S888 KXXKXXRN | |e | XRXXM [RRR 
LOCK 


CASn 


ECASn 


TL/F/9338~B6 


FIGURE 61. 100: Dual Access Port B 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continueq) 


a te 
AREQB 


ATACKB 
RAS 
GRANTB 
ADS 
AREQ 
DTACK 


RAS 


CLK 


AREQ 


RFIP 


Q0-8,9,10 


RASn 


DISRFSH 
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EXTENDRF 


TL/F/9338-B7 
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= 


201 


FIGURE 63. 200: Refresh Timing 


1-251 


202 


ee es i 


TL/F/9338-B8 


V2eS8d0/VWl2S8d0/v0ceS8dd 


DP8520A/DP8521A/DP8522A 


15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 


CLK 


ALE i 


aK) — 


m0 meet __ a ae 


Bl 


co-88.10 2 (ont OY Po 


al: ae 
— 


AREQ 


DTACK 


at 
w-s8.10 XXXXKX ae XXX m= 


FIGURE 64. 300: Mode 0 Timing 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 


CLK 
309 
301b 
i ‘ 
308 36 
oa om, en 


le ee Tay 
av8830 XXXXXXKK eX comm vor Xm YX 


E 
VeeS8d0/VicS8dd/V0cS8dG 


Q 
t 
~ 
e 
= 
=z 
7 
vat 
Zz 
So 
= 


307 . 
RAS1 ; : 
CASi : 


RAS2 


303 


XXX 


UXAXKXXXXAY KXXXXXXXXXXKX 


™ XXXXXXK 
3 


OOK - 


al 


XKKKXKXKK KKKKXKX 


TL/F/9338-CO 


(Programmed as C4 = 1,C5 = 1, C6 = 1) 
FIGURE 65. 300: Mode 0 Interleaving 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (Continued) 


RO~8,9,10 
C0=8,9,10 
BO, B1 


412 


WIN 


WAITIN. 


TL/F/9338-—C1 


_ FIGURE 66. 400: Mode 1 Timing 
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15.0 AC Timing Parameters: DP8520A/DP8521A/DP8522A (continued) 
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FIGURE 67. 400: COLINC Page/Static Column Access Timing 


al 
n 
bb ~< 


5 


TL/F/9338-C2 


CLK 


502 
500 


RO=9 


core OX OXY KKK 


ADS : | 
Ot KXXXXXKAKKKK KK EKA XK KKK KKK KKK 
503 _ 
ia 


OOOO XX OOK 


FIGURE 68. 500: Programming 
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15.0 AC Timing Parameters: DP8520A/21A/22A (continued) 
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FIGURE 69. Graphics Timing Diagram 
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16.0 Functional Differences 
between the DP8520A/21A/22A 
and the DP8520/21/22 


1. Extending the Column Address Strobe (CAS) 


CAS can be extended indefinitely after AREQ transitions 
high in non-interleaved mode only, providing that the user 
program the DP8520A/21A/22A with the ECASO (negat- 
ed) during programming. To extend CAS, the user contin- 
ues to assert any or multiple ECASs after negating 
AREQ. Extending CAS with RAS negated can be used to 
gain RAS precharge time. By negating AREQ, RAS will be 
negated. The user can then continue to assert a one or 
both of the ECASs, which will keep CAS asserted. By 
keeping CAS asserted with RAS negated, the VRAM will 
keep the data valid until CAS is negated. Even though 
CAS will be extended, DTACK output will always end 
from AREQ negated. 


2. Extending DT/OE Functionality 


The DT/OE output will follow the CAS output during a 
VRAM read access, and will remain negated during a 
VRAM write access. For the DP8520/21/22, the DT/OE 
output remained negated for all VRAM access cycles. 
This will allow the VRAM to drive the data bus. There are 
2 options for the function of the DT/OE output during a 
video shift register load operation. With ECASO negated 
during programming, the DT/OE output will follow the 
VSRL input during video shift register load operations. 
With the ECASO asserted during programming, VSRL will 
assert DT/OE. VSRL negated before four rising clock 
edges will cause DT/OE to be negated. VSRL asserted 
more than four rising clock edges will cause DT/OE to be 
negated from the fourth rising clock edge. 
3. Dual Accessing 

RAS will be asserted either one or two clock periods after 
GRANTB has been asserted. The amount of RAS low 
and high time, programmed by bits RO and R1, deter- 
mines the number of clock periods after GRANTB chang- 
es before RAS will start. This is shown in the table below. 


RAS Asserted 
During 
Refresh 


RAS Asserted from 
GRANTB Change 


1 Rising Clock Edge 
1 Rising Clock Edge 
2 Rising Clock Edges 
2 Rising Clock Edges 


& 


. Refresh Ciock Counter 


The refresh clock counter will count and assert RFRQ 
externally when it is time to do a refresh. This will occur 
even when internal refreshes are disabled. This allows 
the user to run the chip in a request/acknowledge mode 
for refreshing. ECASO is used to program the RFIP output 
to act as either refresh request (RFRQ) or RFIP. ECASO 
asserted during programming causes the RFIP output to 
function as RFIP. ECASO negated during programming 
causes the RFIP output to function as RFRQ. 


. Clearing the Refresh Clock 


The refresh clock counter is cleared by negating 
DISRFSH and asserting RFSH for at least 500 ns. 


a 
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17.0 DP8520A/21A/22A User Hints 


1. All inputs to the DP8520A/21A/22A should be tied high, 
low or the output of some other device. 

Note: One signal is active high. COLINC (EXTNDRF) should be tied low 
to disable. 

2. Each ground on the DP8520A/21A/22A must be decou- 
pled to the closest on-chip supply (Vcc) with 0.1 uF ce- 
ramic capacitor. This is necessary because these 
grounds are kept separate inside the DP8520A/21A/ 
22A. The decoupling capacitors should be placed as 
close as possible with short leads to the ground and sup- 
ply pins of the DP8520A/21A/22A. 


3. The output called “CAP” should have a 0.1 F capacitor 
to ground. 


. The DP8520A/21A/22A has 200 series damping resis- 
tors built into the output drivers of RAS, CAS, address 
and DT/OE. Space should be provided for external 
damping resistors on the printed circuit board (or wire- 
wrap board) because they may be needed. The value of 
these damping resistors (if needed) will vary depending 
upon the output, the capacitance of the load, and the 
characteristics of the trace as well as the routing of the 
trace. The value of the damping resistor also may vary 
between the wire-wrap board and the printed circuit 
board. To determine the value of the series damping re- 
sistor it is recommended to use an oscilloscope and look 
at the furthest VRAM from the DP8520A/21A/22A. The 
undershoot of RAS, CAS, DT/OE and the addresses 
should be kept to less than 0.5V below ground by varying 
the value of the damping resistor. The damping resistors 
should be placed as close as possible with short leads to 
the driver outputs of the DP8520A/21A/22A. 


5. The circuit board must have a good Vcc and ground 
plane connection. If the board is wire-wrapped, the Voc 
and ground pins of the DP8520A/21A/22A, the VRAM 
associated logic and buffer circuitry must be soldered to 
the Voc and ground planes. 


- = 


6. The traces from the DP8520A/21A/22A to the VRAM - 


should be as short as possible. 


. ECASO should be held low during programming if the user 
wishes that the DP8520A/21A/22A be compatible with a 
DP8520/21/22 design. 


18.0 Description of a DP8522A/ 
DP8500 System Interface 


Several simple block and timing diagrams are inserted to 
help the user design a system interface between the 
DP8520A/21A/22A VRAM controller and the Raster Graph- 
ics Processor DP8500 (as shown in Figure 70). For access- 
ing the VRAM, the DP8520A/21A/22A uses the RGP’s 
PHI 2 clock as an input clock and it runs in Mode 1 (asyn- 
chronous mode). This allows the user to guarantee row, col- 
umn and bank address set up times to a rising clock edge 
(as shown in timing calculations provided). This system de- 
sign uses a PAL® to interface the access request logic and 
the wait logic between the DP8522A and the RGP. External 
logic is also needed for plane control. 


N 
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18.0 Description of a DP8522A/DP8500 System Interface (Continued) 


DECODER 


ADDRESS | 


The main idea of the block diagram in Figure 73 is to cause 
the video DRAM shift register load operation to happen cor- 
rectly. Once the DP8500 (RGP) issues the Display Refresh 
REQuest signal (DRREQ) the system knows that the video 
shift register load operation should occur at a certain de- 
fined time later. In the block diagram this time is controlled 
by the counter device. This counter determines when VSRL 
transitions high, thereby causing the video shift register load 
operation. —— 


in the upper part of the block diagram is the “REFRESH” 
output that is used as the “VSRL” input of the DP8522A 
and is also used to create “REFRESH NOT DONE”. To 
creat the “REFRESH” output a NAND latch is used. This 
latch is set when a screen refresh is in progress, shown by 
the RGP outputs ALE, BO, and B1 all being high. If the 
status of the RGP is anything other than screen refresh the 
latch is reset. The latch is also reset during a screen refresh 
when the load shift register counter times out. This counter 
determines when the “VSRL” input of the DP8522A tran- 
sitions high, causing the video DRAMs to load a row of data 
into their shift registers. 


The “REFRESH” signal along with the load shift register 
counter output “NOTDONE” are used to create the 
“REFRESH NOT DONE” signal. This output is used for two 
purposes. One of which is to hold the “WAIT” output low, 
thereby inserting WAIT states into the RGP video shift regis- 
ter load access. The other purpose is to hold “DT/OE” low 
until “VSRL” transitions high. . | 


ADS, AREQ 


Te!" 


CAS PLANE CONTROL 
AND ENABLE LOGIC 


FIGURE 70. DP8422A/DP8500 (RGP) Interface Block Diagram 
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Important setup timing parameters which must be met for a 
DP8500(RGP)-—DP8522A system (assuming RGP is running 
at 20 MHz (Tcp = 50 ns)). 


1. Address Setup to ADS Asserted 


= 1Tcp — #ta_y + Derating the DP8500 Spec for Light 
Load — tpr373 + Min PAL Delay to Produce ADS & 
AREQ | | 


= 50ns — 38ns + 5ns — 8ns + 2ns 
= 11ns | 


_ (Using Light Load Timing Specs, the DP8520A/21A/22A 
Needs) 


9 ns Setup for Row Address to ADS Asserted 

11 ns Setup for Bank Address to ADS Asserted | 
2. ADS Setup to Clock Rising Edge 

= 1Tcp — #tatey — Max PAL Delay 

= 50 ns — 26 ns — 10 ns 

= 14ns 

~ (DP8520A/21A/22A Needs 7 ns) 
3. WAIT Negated Setup to Clock 

= 1Tcp — $18 — Max PAL Delay 

= 50 ns — 28ns — 10ns 

= 12ns 

(The DP8500 Needs 5 ns Setup Time) 

Note 1: “$” symbol refers to a DP8520A/21A/22A timing parameter. 
Note 2: ‘“#” symbol refers to a DP8500 timing parameter. 


Note 3: ALE asserted by the RGP (DP8500) should use the system PAL to 
assert WAIT in order to guarantee proper setup time. DTACK low should 
then be used to negate the WAIT signal through PAL equations. 


18.0 Description of a DP8522A/DP8500 System Interface (continued) 
T1 T2 | T2(W)| T3 T1 

PHI 1 

(© 20 MHz) 


PHI 2 
(8522A CLK) 


ADDRESS ® Sibonre f 


oO | 
BS 1-0 a ae ae 
f 


cis 


ee van x 


ips 


DTACK \ 
a bee 


WR 


> 
w 
me 
<a 
EE Geen 


a 

¢ 
a dak 
a 
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FIGURE 71. DP8522A/DP8500 (RGP) 
Instruction Read Cycle Timing 
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18.0 Description of aap parin DP8500 System — (Continued). 


PHI 1 


PHI 2 
(DP8522A CLK) 


ALE 


ADS 


" 


AREQ 
appRess X 
BS (1=0) 
RAS 
CAS 
DTACK % 


(from DP8522A) 


WAIT 
(FOM PAL®) 


WR 


a 


ALE 
BSO 
BS1 


HSYNC 


SHIFT CK 


DRREQ 


ALE 


BSO 
BSi~—= 


f, 
_ 
: ae 


stampa 9 
= an eo 


arta 
aia 
noe 


a ( ADDRESS VALIO ) Cs is XXRERRKE EXE RXR ERIE 
Sea Se angels 
>, ee 


T2(W) 


"STATUS VALID (00) 


T1 


FIGURE 72. DP8500 (20 MHz)/DP8522A 
Write Operand Cycle Timing 


WAIT (to the DP8500) 


DP8520A 


AREQ 


FIGURE 73. Mid-Scan Line Load Application Example 
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FIGURE 74. Mid-Scan Line Load Functional Timing 
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National © 
Semiconductor 


DP8530 Clock Generator 


|General Description | Features | 
The DP8530 is a clock generator intended for use in medi- ™ On-chip crystal oscillator and phase-locked-loop gener- 
um-performance CRT graphics systems. The device gener- ate TTL and ECL PCLK and LCLK outputs 


ates both ECL and TTL pixel and load clocks from a single m™ 125 MHz ECL differential output pixel clock (PCLK) 
crystal resonator using digital phase locked loop (PLL) tech- © ™ Gated TTL and ECL load clock (LCLK) outputs 


niques. The L counter inputs allow the pixel clock to be =m pixel clock to load ‘clock divide ratios from 4 to 32 in 
divided by 4 to 32 in increments of 4 to drive the LCLK. The —_increments of 4 


S counter inputs allow the system clock (XOUT) to run up to 
four times the LCLK. Both free-running and gated (by ENIN) 
LCLK outputs are available. 


Block Diagram numbers in parentheses indicate pin numbers 


XTLB (13) O=——> 
XTLC (14) O=—> 


XOUT (17) Om 


CPO (10) 


iS ——O OPAMP (11) 
an LO=L2 (9,8, 12) 


| : © VCO! (6) 
d L | 
CNTR 0 VC02(7) 


VCC0(21) O——— 


2 ; PH DET | 
SO, $1 (16, 15) O-> CNTR ers 


GNDO (20) O——= | 
VCC1 (18) O—— LCLK TIMING 
GND1 (19) =—— AND CONTROL 
GND2 (5, 22) C= iV \/ | VA \/ 
VCC3(3)O———-_—s«a3H , oe J 


UCLK2 LCLK2 LCLK1 ENIN PCLK2 PCLK2 PCLKt 
(27) (28) (25) (26) (1) (2) = (23) 


TL/F/9328—1 
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Connection Diagram 


N 
So 9 
> > 


LCLK2 
LCLK2 
ENIN 


GND2 
vco1 
vco2 
L1 
LO 
CPO 
OPAMP 
14 15 16 
om © (oe ae 
“ge% 338 
TL/F/9328-2 
Order Number DP8530V 


See NS Package Number V28A 


Pin Descriptions 

1, 2, 23—PCLK2, PCLK2, PCLK1: Differential ECL and TTL 
compatible pixel clock outputs driven by the VCO (ECL out- 
puts are 10k and 100k ECL compatible). 

3—VCC3: ECL output buffer positive power supply. 
4—VCC2: ECL internal logic positive power supply. 

5, 22—GND2: ECL negative power supply. 

6, 7—VCO1, VCO2: External tank circuit connections for 
the Pierce VCO. 

8, 9, 12——L1, LO, L2: A three-bit word input used to select 
the L Counter modulus. Any modulus from 4 to 32 may be 
selected in increments of 4. LO is the least significant bit. 
10—CPO: Charge pump output. Used in conjunction with 
OPAMP to form the external loop filter. 

11—OPAMP: Op amp output of the loop. This output is 
used to control the pixel clock frequency via the varactor 
diode in the LC tank circuit. 

13, 14—XTLB, XTLC: External connections for the Pierce 
crystal oscillator. 
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15, 16—S1, SO: Two-bit word that determines the S Counter 
modulus. SO is the least significant bit. 


17—XOUT: MOS output generated from XTLB, XTLC. 


18—VCC1: TTL output buffer supply. Specified for 5V 
+10% operation. 


19——-GND1: TTL output buffer supply return. 

20—GNDO: TTL logic power supply return. 

21—VCCO: TTL logic positive power supply. Specified for 
5V +10% operation. 

24—LCLKO: TTL load clock output. Also is connected to an 
input of the phase comparator. 

25—LCLK1: Gated load clock output. 

26—ENIN: TTL compatible video enable input. A high on 
this input starts LCLK1 and LCLK2 on the next positive tran- 
sition of LCLKO. 


27, 28—LCLK2, LCLK2: ECL compatible load clock. 
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Absolute Maximum Ratings 


If Military/Aerospace specified devices are required, ECL Signals 
please contact the National Semiconductor Sales Output Current —50mA 
Office/Distributors for availability and specifications. ESD rating 1000V. 
Storage Temperature —65°Cto+150°C 
TTL Signals Se 

Inputs 7 ; 7.0V 

Outputs a 7.0V 


Recommended Operating Conditions 


Symbol A a ee ee Oe Units 
VCCO,110GNDO,1 | TTLPowerSupply | |TV 
VCC2, 3 to GND2 ECL Power Supply | Modet(Notes) | 45 | | 55 | , 

| Mode2 (Notes) | 42 | | 87 _| 
Vin | HighLevel inputVoltage, TTLinputs(Note2) | 2 | | TV 
Vit | _LowLevelinput Voltage, TTLinputs(Note2) | | | 8 
Io | High Level Output Current, TTL Outputs (Notes) |_| | 0-4 | 
| HighLevel Output Current, MOSOutpuis || |= 
lot | LowLevelOutputGurrent, TTLOutputs (Notes) | | | 
| _LowLevelOutputGurrent MOSOutputs || | 
FPCLK Pixel Glock Frequency = | TTL | |||, 


FXTL | Crystal Frequency ae MHz 


Tsu1 | Setup Time ENIN to LCLKO (Note 1) es Se ue 
TH Hold Time LCLKO to ENIN (Note 1) | o | -o | | re 
TAMBIENT 3 Operating Temp Range PeeOe i 76 °C 


Note 1: See timing waveforms for relevant signal edges (positive or negative) from which all setup and hold time measurements are made. 
Note 2: TTL inputs—ENIN, SO, $1. 

Note 3: TTL outputs—XOUT, LCLKO, ENOUT1, 2. 

Note 4: Inputs LO, 11, L2 designed to be tied to VCC(2, 3) for high level or tied to GND2 (or left open) for low level. See input schematics. 


Note 5: Mode 1: GND2 = OV; 
Mode 2: VCC2 = OV. 


DC Electrical Characteristics 


Symbol|_ Parameter_| Conditions se i _t_t Units 


Vic Input Clamp |VCC(0, 1) = 4.5V, li = —18mMA 
— 


Von TTLOutputs Io = —400 pA fvecot-2} | | 
VECO N=45ViMos outputs | lon=-100na | veo@.1)-o4] | | 
| lon= -400wa | voo,1y-23| | 
ECL Outputs 509 Load to Voc ~ 2V |VOC(2, 3) ~ 1138| _| Vote, 3) ~ 880| mv 
OPAMP Output lou = —1261CP0sm voo@s)—12) |__| 
a ae 
posoupe |= $NA ff gk 
eee 
VCO(2, 3) = OV [ECL Outputs __ 80 Load to Veo — 2V |VCOW2, 3) — 1998|_|VOC(2, 3) ~ 1480] mv 
GND2 = — 4.2V|OPAMP Output -1.251CPO Source _| aaa oe eer oaag 9 


I Max High Level/VCC(0, 1) = 5.5V/TTL Inputs Vin = 7V 
Input Current |GND(O, 1) = OV 
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VCC(2, 3) = OV 
GND2 = —4.2V 


VOL 


VCC(0, 1) = 4.5V 


DC Electrical Characteristics (Continued) 


Symbol | Conditions 


lH High Level VCC(0, 1) = 5.5V TTL Inputs GND(O, 1) = OV Vin = 2.7V 
| Input Current 

He Low Level VCC(0, 1) = 5.5V TTL Inputs GND(O, 1) = OV. Vin = 0.4V 
Input Current 


lo Output Drive | VCC(0,1) = 5.5V TTLOutputs  GND(0, 1) = OV Vout = 2.25V 
Current 


0€S8dd 


4 
> 


| 
no 
=) 
o 

= 
> 


| 
a) 


| 
jo) 
fh 
-{ 
ve) 
< 


lcpo | Charge Pump |VCC2=0V —GND(0, 1) = ov mA 
Current | mA 
| 
loc | Supply Current 22 


VCC(2, 3) = 5.5V, GND2 = OV 


VCC(2, 3) = OV GND2 = —5.7V 10k ECL range 
— |GND2 = —4.8V 100k ECL Range 


| 
ra ire ee ee ee 
o}o a — 
Otro 

: 

> 


mA 


— 
MN 
io) 


| 
— . 
o>) 


AC Electrical Characteristics 


Symbol Conditions 


| Min | Typ | 
we eocewey —————— 

ef i ee a 
Fwax | MaxxTLFreg | | 
Li | boikitopciKs | Cua t5pF | | 
Ta |tctkatopcixe | ck ispF | 
Tuouxr | kekototcikr | l= 1seF || 
TLouke 


Timing Diagram 
~ PCLK2 | 


T2 
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” 
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= 
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LCLK2 


LCLK2 
T1 
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LCLKO 


| 
pa! 
QoQ 
o 
x 
sS 
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Circuit Operation 


The heart of the DP8530 Video Clock Generator is a crystal 
oscillator which is used as a frequency reference to gener- 
ate several clock signals required in a video display system. 
Among the clocks generated are the ECL load and pixel 
clocks (LCLK2 and PCLK2) for high speed video shift regis- 
ter parallel load and shift operations, and TTL load clocks 
(LCLKO and LCLK1) for moving DRAM data to the video 
shift registers. The LCLK and PCLK outputs are all internally 
synchronized in order to simplify system timing. 


The LCLK and PCLK outputs are generated using a digital 
phase locked loop as shown in Figure 7. 


) 


vCx0 Ls _ SOUT 


COUNTER 


vco 
LCLKO@_) 


<——{> PCLk 


L 
COUNTER 


FIGURE 1. PLL Block Diagram 


The loop consists of the S and L counters, a phase compar- 
ator, and a voltage controlled oscillator (VCO) with the rela- 
tionship between these elements in the loop defined as: 


VCXO XL 
S 


where PCLK is the pixel clock frequency, L is the L Counter 
modulus, and S is the S Counter modulus. When the fre- 
quency of the VCO (PCLK) in the phase locked loop is sta- 
ble the inputs to the phase detector are in phase; thus the S 
Counter and L Counter outputs are identical in both phase 
and frequency. The crystal oscillator ensures that the phase 
and frequency of the S Counter output remain constant. Any 
drift, or change in frequency, of the VCO will be divided 
down and appear as a shift in phase at the L Counter out- 


TL/F/9328-4 


PCLK = 


put. The phase detector will sense this error and generate a | 
correction voltage for the VCO input which is proportional to 
the magnitude of the frequency error. This correction volt- 
age will change the VCO frequency to eliminate the error 
thus keeping the loop locked. 


The presence of the S Counter in the loop enables the 
graphics processor to operate at full speed independent of 
PCLK frequency. The video shift register’s parallel data 
width determines the L Counter modulus. An 8-bit parallel 
shift register would use an L Counter modulus of 8 so that a 
parallel load pulse occurs once every 8 pixel clocks. The L 
Counter output is used to derive the three LCLK outputs. 


The ECL LCLK2 output is used in conjunction with the 
PCLK2 output to load data into a high-speed video shift reg- _ 
ister. The PCLK2 provides the clock and LCLK2 provides 

the load signal for the shift register. 


The other two LCLK outputs (LCLKO, LCLK1) are TTL out- 
puts. They can be used to contro! a selection of different 
DRAMs/video shift register configurations as shown in the 
typical system architecture section which follows. 


The S Counter can be programmed to divide by any integer 
up to 4 and the L Counter can be programmed for any word 
width from 4 bits to 32 bits in increments of 4. Table | shows 
some of the frequencies possible using various values for 
the S-modulus with an L-modulus of 8. 


TABLE I. Partial Table of PCLK Frequencies 


26.7 MHz 


Figure 2 demonstrates the configuration and the resultant 
waveforms with a DP8530 driving a video shift register and 
a DRAM. : 


PCLK2 


LCLK2 


FIGURE 2. System Configuration and LCLK Waveforms Using DRAM 
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Loop Filter Calculations 


Several constraints need to be known in order to determine 
the loop filter components. They are the loop divide ratio 
(N), the phase detector gain (Kp), the VCO gain (Ko), the 
loop bandwidth (Wo), and the phase margin (¢). 


The constant Ky is fixed at 80 »A/rad for the DP8530. N is 
the L counter modulus for the loop. A 60° phase margin is 
recommended, however the equations allow other values to 
be used if desired. 


The oscillator gain constant Ko can be obtained from Table 
i! or obtained experimentally. This is done by driving R2, the 
resistor which normally connects the varactor to the 
OPAMP OUTPUT, with an external power supply. Set the 
supply to GND2 + 3V and note the PCLK frequency. Next, 
set the supply to GND2 + 2V and note the frequency again. 
The difference in these two frequencies (times 27r to con- 
vert to radians) is Ko. For optimum performance the desired 
PCLK frequency should be somewhere between the two fre- 
quencies measured above. This may require adjustment of 
the coil. 


Before choosing a value for Wo one fact should be pointed 
out: R2 and C3 (the coupling capacitor between the coil and 
the varactor) form a low pass filter. Thus the loop bandwidth 
must be chosen to be less than the cutoff frequency of this 
filter. We recommend a value of 100 Hz to 3 KHz (times 27r) 
for Wo. 


Having determined all these constants the following equa- 
tions are used to find the component values: 


e Ri = (1.08 N Wo)/(Kp Ko) 

© C1 = (3.46 Kp Ko)/(N Wo?) 

¢ C2 = (0.27 Kp Ko)/(N Wo?) 

To use a phase margin of other than 60° use the following: 
e R1 = (N Wo/2 Kp Ko) (cosec 0 + 1) 

© C1 = (2 Kp Ko/N Wo?) (tan 0) 

© C2 = (Kp Ko/N Wo?) (sec 0-tan 0) 

For example: design a system with the following character- 
istics: 

¢ 1560 pixels per line (2000 pixels including retrace) 

© 8 bit wide video data 

¢ 10 MHz processor rate 


The PCLK frequency will be 10 MHz x 8 bits = 80 MHz. 
The components in Table Il will be used. Note that Ko is 
19 Mrad/Volt. Because it is an 8-bit wide system the L coun- 
ter modulus must be 8. By choosing Wo to be 2800 Hz 
(times 27r) the equations give: 


e R1 = 1000 
e C1 = 2.0 pF 
e C2 = 0.17 pF (0.2 wF may be used) 


TABLE II 
Recommended VCO Components 


27 ka 


1 . 
F = 
OO Oa VLCToT 


GND2 


Frequency L 
(MHz) pH 


Part# pF Motorola # Mrad/volt 
aa | EsozHNs-6o00026 | 56 | 30 | mvzoo | 16 
| 80 | _ 017 | Esoains-aoo002e | 30 | 30 | mvzoo | 19 
| 120 | 07 | es02HNS-2000022 | 39 _| 


a | 
Cor = ——— 


—, + —_—_—_—_—_———— 
VARACTOR CA  CvaRACTOR 
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Input Schematics 


GND2 


TL/F/9328-9 | GND1 | | GNDO | | 
FIGURE 3. VCO Inputs = TL/F/9328-10 
| ~~ FIGURE 4. TTL Inputs 


- VCCO 
vcc2 


GND2 
TL/F/9328-11 


FIGURE 5. L Inputs 


GND1 GNDO 


TL/F/9328-12 
FIGURE 6. S Inputs 


TL/F/0328-12 
FIGURE 7. Typical ESD Circuit 
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Oo 
. U 
Output Schematics aA % 
vec? S 
260 
OPAMP 
GND2 
TL/F/9328-14 pee 
FIGURE 8. Op Amp Output 
GND1 


TL/F/9328-15 
FIGURE 9. TTL Outputs 


CPO 


VCC3 


10ka 


GND2 
FIGURE 10. ECL Outputs 


TL/F/9328—16 


GND2 
TL/F/9328-17 
FIGURE 11. Charge Pump Output/Op Amp Input 


Single Power Supply Operation Dual Power Supply Operation 


L +4,5V 
TO 
+5,.5V 
REGULATOR veco vCci vCC2 VvCC3 


veco vcci vCC2 vCC3 


#4.5V 


TO DP8530 DP8530 
+5.5V 
REGULATOR GNDO GND1 GND2 GND2 GNDO _GNDi GNO2_ GND2 


=5.7V 
TO 
=4.2V 
REGULATOR 


TL/F/9328-6 
TL/F/9328-7 
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Typical Application 


10 MHz coo 


39 pF 


L 


L 


L = TTL Logic Zero (GNDO, 1) 
H = TTL Logic One (VTTLO, 1) 


S Counter Inputs 


c2 27 ka 


500 pF 


DP8530 0.07 wH 


MV209 


vco 
si LO Li = L2 


39 pF 
68 pF 
VCCO, 1 


4 __| __Gndo, 1} vec2, 3 
i) 
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PCLK = 80 MHz 
= +16 
Split Supply Mode 


TABLE Il. Counter Modulus Tables 
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PRELIMINARY 


-DP850EB Raster Graphics Processor 
Four Plane Evaluation System 


640 x 480 resolution stand-alone PC 

board for easy evaluation and a large 

breadboard area for development 

m DP8500 Raster Graphics Processor 
featured in a basic 20 MHz four-plane 
architecture. On-board connectors for 
the power supply, multisyne monitor, 
and 25-pin serial port cable 

m Board design incorporates 16V8 GALs 

for reduced power consumption 


TL/F/10431—-1 


DOS-based software includes an 
absolute assembler, symbolic debugger, 
a software simulator, and sample source 
code programs ie 
Technical documentation includes | 
schematics and programmable logic 
equations a 

Host platform is a DOS AT/XT with an 
RS232 serial port and a multisync 
monitor 


Product Overview 


The DP850EB is designed to provide a simple, effi- 
cient means of evaluating the Advanced Graphics 
Chip Set architecture. Provided with complete techni- 
cal documentation, its own power supply, and soft- 
ware tools, the board connects to the host PC com- 


puter by means of an RS232 interface. This interface 
allows programs to be downloaded to the graphics 
board where they can also be debugged with the full- 
featured serial-interface debugger. 
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DP850DB8 Raster Graphics Processor 
_ Development System — 


DP850DB8 


m Utilizes National’s 20MHzAGCS —_«y 
architecture = |= | 

m Supports video rates up to125MHz sy 

m Supports 256 colors out of a palette of 
16 million = . 

m 256 kb of program and font memory 

m 128 kb of socketed “OTP EPROM” 
memory for resident firmware 


General Description 


The DP850DB8 is an-8-plane AT-bus add-in graphics 
card suitable for supporting extensive software devel- 
opment projects based on the AGCS architecture. 


PRELIMINARY 
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m 2 mb of frame buffer memory — 
m On-board RS232 serial port 


m Complete technical documentation, 


_ including schematics and programmable 
-logicequations 

m Third party software support (ordered 
separately) for DGIS, GKS, CGl, 
X-Windows, and Microsoft Windows 
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DP8500 
Raster Graphics Processor Software Tools 


A complete line of development and applications sup- 
port software is provided for the DP8500 Raster 
Graphics Processor. The standard release for all of 
these products is on 51%,” floppy diskettes for hard- 
disk DOS systems containing 640 kBytes of memory. 


Certain UNIX® environments are also supported 
through special order, in particular GENIX™ V.3, 
DECT™ Ultrix™’ V2.0-1, and Sun UNIX 4.2 Release 
3.2. Special source code licenses are available for 
customers who desire to port the software to non-sup- 
ported environments. 


DP85SW1001M5: RGPT™ Software Development 
Package 

(DP85SW1001SP: UNIX or source code special or- 
ders) 
The Software Development 


Package Contains: 

® Macro Assembler 

e Software Simulator 

e Linker 

e Librarian 

@ Object-to-HEX file converter 


RGP C Compiler 


UNIX or source code special or- 
ders) 


(Note: The RGP C Compiler gen- 
erates Macro Assembler source 
code as its output. The RGP 
Software Development Package 
is required in addition to the C 
compiler for a complete C devel- 
opment environment.) 


DP85SW1002M5: 
(DP85SW1002SP: 


DP85SW1003M5: 
(DP85SW1003SP: 


RGP Symbolic Debugger 

UNIX or source code special or- 
ders) 

For the DP850EB or DP850DB8 
boards. Configurable to other 
hardware environments. 


DP85SW2002M5: RGP Graphics Kernel 


The RGP Graphics Kernel pro- 
vides the complete source code 
for the implementation of a real- 
time, multi-tasking, device-inde- 
pendent graphics interface for 
the RGP. 


(Note: The Graphics Kernel is 

written in RGP Macro Assembier 

source code. The RGP Software 

Development Package is re- 

quired to assemble the code into 

executable format.) 

RGP Software Utilities 

The RGP Software Utilities pro- 

vide source code examples of 

many common graphics-related 

functions such as: 

¢ RGP initialization 

e Mathematical operations 

© Generation of curves 

© Sub-dividing complex polygons 
into convex polygons. 

(Note: The Software Utilities are 

written in Macro Assembler 

source code. The RGP Software 

Development Package is_ re- 

quired to assemble the code into 

executable format.) 

Third-Party Software Support: 

Extensive third-party software support is available for 

the RGP, including DGIS, GCI, GKS, X-Windows, and 

Microsoft Windows. Contact your local sales repre- 

sentative for detailed information. 
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An Architectural Solution 
For High Performance 
Graphics 


INTRODUCTION 


Computer graphics is one of today’s fastest growing market 
segments. The fundamental reason for this dramatic growth 
relates directly to the productivity improvement that a graph- 
ic representation can give to nearly all problems in the engi- 
neering, scientific, business, and consumer market seg- 
ments. Nothing does. more to ease the “man to computer” 
interface data interpretation and presentation problems 
than a graphics display and efficient application software. 


Penetration of computer graphics into all market segments 
has grown especially fast in the past few years since it has 
become more affordable. This is due to the decreasing 
costs of high quality CRT displays and significant reductions 
in the cost of Dynamic RAM. In the near future, high per- 
formance graphics support hardware will be more available 
in the form of a number of VLSI integrated circuits, further 
decreasing the costs of graphics hardware. 


Now that more people have a good understanding of the 
importance of computer graphics to the productivity of their 
job, the demand for higher resolution, faster graphics will 
increase significantly. This will be especially true in the engi- 
neering, scientific, and business communities. The current 
challenge for this generation of graphics hardware is clearly 
in the area of affordable, high resolution, fast computer 
graphics. Attaining these goals will not only do much to ex- 
pand the useage of computer graphics but also expand ap- 
plication areas. 


GRAPHICS ARCHITECTURE 


It is not the purpose of this paper to discuss the varied 
aspects of software and hardware computer graphics archi- 
tectures. What will be addressed are the problems of hard- 
ware architectures in the area of list driven display genera- 
tion, manipulation, and screen refresh. Current mono- 
chrome graphics hardware architectures generally do not 
satisfy the current demands of high performance computer 
graphics which include display resolutions of 1000 pixels by 
1000 pixels and above, very high speed screen manipula- 
tions, and color. 

To reinforce the growth of computer graphics, hardware ar- 
chitectures must be developed which will support the rapid 
growth in display resolution which will occur in the next few 
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years. The architectures must allow a high degree of appili- 
cation independence yet not require significant changes in 
the overhead software interface. The hardware architecture 
must be expandable, again avoiding significant software 
changes. Finally, the architecture must be implemented in 
VLSI for reasons of cost and speed. 


This paper will describe a graphics hardware architecture 
that meets these criteria. 


KEY PERFORMANCE ISSUES 


Figure 7 is a block representation of the logical hardware 
and software partitioning associated with a typical graphics 
system. The left block represents the application interface 
and image creation segments. Data must be translated from 
the database found on disk (or received via communications 
networks) to a format which will be meaningful in the CRT 
display. At the application level this may involve database 
interpretation and translation to and from graphics stan- 
dards such as GKS and CORE. But it also involves the final 
translation from what is described as the “world space” rep- 
resentation to the “normalized’’ coordinate space associat- 
ed with the hardware of the particular graphics system in 
use. This is done with the Main CPU in software or through 
hardware acceleration via usage of a number of VLSI inte- 
grated circuits on the market. 


Once this translation is completed the transfer to the actual 
“display” coordinates is usually accomplished through an 
instruction list given to a graphics processor linked directly 
to the CRT frame buffer. It is the function of this graphics 
processor to offload the main CPU in the creation and ma- 
nipulation of the massive amounts of data in the actual bit 
map in the frame buffer. Without this block, any large bit 
map manipulation would cripple the main CPU in its main 
application chores. 


The final blocks represent the portion of the graphics sys- | 


tem which handles the lower level image movements and 
manipulations as well as the screen refresh function. High 
speed hardware implementations of these blocks with an 
architecture that links closely with the graphics processor is 
the only way performance can be maintained as display res- 
olution increases along with the increasing number of mem- 
ory planes for color. 


APPLICATION CO=PROCESSOR 
INTERFACE ——> GRAPHICS VIDEO 
GKS, CORE, VDI INTERFACE MANIPULATION 


APPLICATION § 


TRANSLATION } 


GRAPHICS | 
f INSTRUCTION | 
| PROCESSOR 


GRAPHICS | 
DATA 
| MOVEMENT 


SCREEN 
REFRESH 
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FIGURE 1 


Reprint by permission of Wescon/85® 
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We will focus on the performance issues of the last two 
major blocks of Figure 7 and factors that will influence. 


© List Driven Display Generation 
e Graphic Data Manipulation and Movement 
e Screen Refresh 


The technical problems of providing cost effective architec- 
tural solutions with present hardware (given the current and 
future performance requirements) make these functions 
ideal candidates for VLSI integrated circuits. 


THE GRAPHICS FRAME BUFFER— 


_ SCREEN REFRESH ARCHITECTURE 


To focus on these performance issues requires a good un- 
derstanding of the tradeoffs associated with the graphics 
frame buffer. Figure 2 is a simplified diagram of a typical 
graphics workstation. The graphics controller is shown 
closely linked to the memory associated with the CRT 
screen display (called the CRT refresh memory or the frame 
buffer) and the main CPU with its associated graphics ge- 


MAIN MAIN GRAPHICS 
CPU(s) MEMORY keine 
PROCESSING 


NETWORK | 
INTERFACE 


DISK/ TAPE 


INTERFACE 


is ETHERNET 


CONTROLLER 


ometry processing hardware and software. This figure 
shows the popular configuration in which a special proces- 
sor is allocated to handle graphics functionality thus off 
loading the main CPU with respect to fundamental graphics 
operations. Those operations are list driven based on a low- 
er level graphics software language. We will talk later about 
advantages of this and some specific implementations. 


The image that is stored in the frame buffer is actually 
stored conceptually in three dimensions as shown by the 
cube represented in Figure 3. Each pixel on the screen can 
be mapped to multiple pixels (in the case of color) contained 
within this cube. Access to the data stored in this cube is 
necessary for both screen refresh, update, and data manip- 
ulation. Ideally, the amount of time available for manipula- 
tion should greatly exceed the time required for screen re- 
fresh, thus enabling very quick display update and image 
movement. | 


The fundamental graphics tradeoffs involve the need to 
maximize CPU or graphics controller access to the frame 
buffer while maintaining regular (required) CRT refresh. As 
display resolution increases and color becomes more im- 
portant these issues become significant factors in the de- 
sign and cost of the graphics sub-system. 
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FIGURE 2. Typical Graphics Workstation 
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FIGURE 3. Graphics Frame Buffer 
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FIGURE 4. The Graphics Video Loop 


Current video rates for medium resolution graphics systems 
easily exceed 50 MHz. High resolution systems attain video 
rates on the order of 100 to 125 MHz with clear trends to 
rates in excess of 125 MHz in the near future. Given these 
clock rates, screen refresh has become a significant prob- 
lem in the design of the graphics sub-system. 

The high video rates have forced a graphics memory organi- 
zation for refresh as shown in Figure 4. Since the size of the 
frame buffer is large, the desired memory component be- 


cause of cost and space is the DRAM. However, the DRAM ~ 


does not have access speeds suitable for current high reso- 
lution displays. As a result, the screen refresh process is 
implemented through a parallel to serial conversion in the 
“video loop”. To refresh the screen, the graphics controller 
presents a word address to the frame buffer, the resulting 
data word (usually 16 pixels) is then converted into a serial 
video stream via an external (usually TTL to ECL) shift regis- 
ter under the timing and control of the graphics refresh 
hardware. Figure 4 shows the implementation for a single 
plane system, multiple plane systems would require the 
same number of 16 Bit refresh pixel words and parallel to 
serial video shift registers as the number of planes. 


If the video rate is 100 MHz, this parallel to serial buffering 
reduces the need for a parailel word from the frame buffer 
to a clock rate of 6.25 MHz if the word is 16 bits wide. This is 
still an access time of 160 ns. Given this, it is clear that with 
conventional time multiplexed DRAM design, the only time 
left for data manipulation and screen update will be during 
horizontal and vertical blanking when screen refresh is dis- 
abled. With the demands of current graphics end applica- 
tions, this is clearly not enough! 


A number of frame buffer architectures have been devel- 
oped to get around this problem. These include double 
frame buffering, dual porting, and making maximum use of 
page mode access. The most popular solution to this prob- 
lem is the one provided via a new type of DRAM called the 
Video DRAM shown in Figure 5. The video DRAM is a con- 


3-5 


256 COLUMNS 


—R/W 
@ ADDRESS 
956) CONVENTIONAL RAS 
ARRAY 
DATA OUT 
DATA IN 


256=BIT S/R LSERIAL OUT 
| cLock 


TL/F/8769-5 
FIGURE 5. The Video DRAM 


ventional DRAM with a 256 bit shift register on board. This 
shift register is connected such that a complete row of 
memory cells can be loaded into the shift register at one 
time. The shift register has its own output which can be 
controlled with separate input lines to the video DRAM. If 
you configure the Video DRAM as shown in Figure 6, you 
can see we have implemented a dual ported memory. The 
left side is configured as a typical DRAM interface. The right 
side is configured for screen refresh with the parallel word 
being generated from the shift registers of the Video DRAM 
(this parallel configuration is still needed since the Video 
DRAM shift registers are not able to clock at the full video 
rate in high performance systems). 


The advantage that this configuration provides is nearly full 
time access for screen update and manipulation since only 
one load to the shift register is generally needed per scan 
line. While the pre-loaded video data is clocked out of the 
shift register, random access can occur in the frame buffer 
by either the graphics controller or the main CPU. 


Many configurations of the Video DRAM are becoming 
available with multiple vendors committing to the architec- 
ture already in place. It will clearly become the standard 
component for building graphics frame buffers. Any new 
graphics hardware architecture must be designed to opti- 
mize the use of this type of memory component. 
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THE GRAPHICS FRAME BUFFER— 
UPDATE/MANIPULATION ARCHITECTURE 


We have discussed the preferred frame buffer architecture 
with respect to screen refresh. Now we will evaluate the 
tradeoffs associated with the update and manipulation side. 
The performance issues associated with the graphics proc- 
essor or main CPU update or manipulation of the image in 
the frame buffer have a major influence on the data and 
address configuration of the frame buffer. There are gener- 
ally three such configurations—pixel, plane, and mixed. 


The pixel architecture is shown in Figure 7. This architecture 
is best described as one in which the frame buffer data is 
manipulated one pixel at a time. For multiple planes, the 
address to the frame buffer generates a data word which is 
composed of pixels at the same location across multiple 
planes—thus the term “pixel depth”. This is the architecture 
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FIGURE 7. Pixel Architecture 
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often found in image processing and solids modeling appli- 
cations where the value of each pixel is very computation 
intensive due to color value or shading variations. These 
applications typically require 16 to 32 memory planes. 


In the plane architecture, Figure 8, the frame buffer data is 
manipulated one word (usually 16 bits) at a time within each 
plane. To change one bit, 15 other bits must be carried 
along. Also, since the word boundary of the 16 bits exists, a 
barrel shifter is needed if image placement and movement 
accuracy is needed down to the actual pixel level. Despite 
these disadvantages, in the engineering and business appli- 
cation area, the plane architecture is the most popular since 
these applications are less pixel computational intensive but 
more data creation and image movement intensive. This ar- 
chitecture is lower cost and brings with it higher perform- 
ance when large bit maps must be manipulated. ; 
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FIGURE 8. Plane Architecture 


Where the application has need for both types of architec- 
tures, the mixed architecture shown in Figure 9 is imple- 
mented. Here, access to the frame buffer can be either at 
word width or pixel depth, thus providing the best of both 
worlds. In the past, this architecture has been implemented 
only in the more expensive of workstations due to the over- 
head hardware costs. But, the applications of workstations 
that need high speed update along with computation inten- 
sive displays is increasing. The time has come for a graph- 
ics hardware architecture that can efficiently merge these 
two architectures. 
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FIGURE 9. Plane and Pixel Architecture 


WORD BOUNDARIES AND THE BARREL SHIFTER 


In the engineering and business community, the most popu- 
lar frame buffer architecture is the plane architecture. But as 
the quality of these displays increase, restrictions associat- 
ed with the plane architecture word boundary constraints 
have limited the flexibility and performance of these sys- 
tems. To solve these problems, some of the designs have 
switched to the pixel architecture. With the “correct” archi- 
tectural solution this switch would have not been necessary. 


Figure 10 demonstrates the word boundary problem via a 4 
bit word example. This figure demonstrates a “word bound- 
ary aligned” translation. The relative pixel locations of the 4 
bit word are the same at the destination image as they are in 
the source image. Thus the pixels maintain the same align- 
ment to the word boundary. This is a simple transfer where 
manipulation of the pixel positions in the source word need 
not be modified to transfer it to the destination. 
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FIGURE 10. Plane Oriented Transiation 
(Word Boundary Aligned) 


In Figure 77, the source word pixel alignment does not 
agree with the desired pixel destination alignment. In this 
transfer, the source pixel map is shifted one pixel to the right 
in the transfer. This type of control is not available if the 
frame buffer is addressed by words. If the frame buffer is 
addressed by pixel, then the price paid for overhead access 
to do this simple manipulation is significant. A better solu- 
tion to this problem is to employ an additional hardware 
function called a barrel shifter and maintain word boundary 
addressing. 
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FIGURE 11. Plane Oriented Translation (Non-Word 
Boundary Aligned) 


A simplified diagram of a barrel shifter is shown in Figure 12. 
in order to maintain consistency across word boundaries on 
a shift like the example in Figure 77, two source words must 
be read to create a new shifted destination word. The barrel 
shifter solves the fundamental restrictions of the plane ar- 
chitecture with respect to exact pixel manipulations. But, if 
these manipulations occur within the main CPU or for that 
matter in the main graphics processor a new performance 
bottle neck may result in multiple plane color systems. The 
ultimate performance of such a system is associated with 
the speed of the barrel shifter and its location. Ideally, for 
maximum performance, you would want a barrel shifter for 
each plane. 
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FIGURE 12. The Barrel Shifter 


BITBLT 


Due to the growth of display resolution and the resulting 
demand for movement of massive amounts of data, the 
need for a lower level graphics operator that well describes 
these data movements and manipulations has increased. 
The most popular operator or function is BITBLT (for BIT 
Boundary BLock Transfer) originally developed at Xerox 
PARC (Palo Alto Research Center). This function is also 
called Raster Op (short for Raster Operator). 
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Fundamentally, BITBLT is a logical operator which simply 
describes the manipulation of rectangular bit maps of any 
size. For this reason it should be viewed as a lower level 
language for graphics bit maps. Figure 73 is a conceptual 
drawing of how BITBLT works. 


The block on the right is a source bit map area which can be 
outside or within the view area of the frame buffer. On the 
left is the destination area generally, though not necessarily, 
within the view area of the frame buffer. If the graphics hard- 
ware is implemented with BITBLT in mind, a single simple 
setup from the main CPU (via an instruction list) can cause 
the graphics controller to move massive amounts of data. 
The only information needed is the absolute pixel address 
of the source and its width and height in pixels, the destina- 
tion pixel address, and the clipping rectangle pixel address 
with its associated pixel width and height. 


Resulting from this single setup, any size bit map can be 
moved without intervention from the main CPU. Additionally, 
if a barrel shifter is integrated within this operation, the 
BITBLT is not bounded by word boundary constraints. Final- 
ly, if multiple BITBLT processors and barrel shifters are 
used, for example one set per plane, transfers within all 
planes can be done in parallel. 
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FIGURE 13 


If interaction is desired from the source image with the desti- 
nation area, logical operations may be implemented within 
the BITBLT operation, such as AND, OR, and XOR. To do 
this, the source data is first read and barrel shifted to align 
with the destination word. The destination word is then read 
and the desired operation performed, creating a new desti- 
nation word. This word is then transferred to the destination. 
This is described by the following equation—- 


SOURCE OP DESTINATION —> DESTINATION 


Where OP can be a logical operator. such as AND, OR, 
XOR, etc. 
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BITBLT is particularly well suited for applications where 
there is a high mixture of graphics, text, and windowing. 
Pop-up menu’s, icons, accelerated filling, fat line drawing, 
and high speed text transfer are all functions where BITBLT 
provides accelerated solutions. a 


Clearly, for any architecture to be a good match to high 
resolution displays and fast screen update, the main CPU 
should not be involved in the simplest of image movement 
chores. The main advantage of BITBLT functionality comes 
in off loading the main CPU with the detail of the bit map 
operations and at the same time adding a powerful function- 
al operator. 


PREVIOUS SOLUTIONS 


Historically, graphics VLSI controllers have not well ad- 
dressed the high resolution color applications of modern bit 
mapped graphics terminals. As a consequence, much of the 
graphics hardware was implemented with bipolar bit slice 
processors and random logic, much of which was Schottky 
TTL or ECL. The result was a costly high power solution, 
difficult to map into future high resolution multiple plane ap- 
plications. 


A new generation of graphics VLSI integrated circuits will be 
introduced during the next year, from a variety of semicon- 
ductor vendors. Many of these 1.C.’s have been developed 
to address the performance problems associated with high 
performance bit mapped displays. In nearly all cases, to 
make these controllers effective, a graphics hardware archi- 
tecture had to be selected. In most cases this has restricted 
the performance and applicability of these devices across 
the spectrum of graphics applications.. 


Typically, most of these architectures suffer from inflexibility 
in the processor to memory plane interface. Figure 74 is an 
example of this problem. Some of these architectures are 
designed to directly support 1, 4, or 8 planes of memory. 
Transition to more planes requires additional processors. 
The result is an associated cost increase in hardware usual- 
ly coupled with a degradation of performance due to syn- 
chronization problems between processors. 


Although performance may be good within the fundamental 
bounds of a single processor, the rules significantly change 
when the transition is made to multiple processors. Nearly 
without exception, this new generation of VLSI components 
has the main controller or processor intimately involved with 
both frame buffer addressing and data manipulation. This is 
where the main limitation exists; when memory plane ex- 
pansion occurs, the graphics processor again becomes the 
bottle neck to graphics system performance. It matters little 
whether this is a BITBLT based architecture with internal 
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FIGURE 14. Current Graphics Chips Color Application 


barrel shifters, the common data and address graphics 
function is the limitation to expansion and performance. 


Many of these new chips begin to address the notion of 
parallelism. Operations can occur in different memory 
planes at the same time. However, due to pin and architec- 
ture limitations, expansion to more planes than supported 
by a single processor significantly degrades performance. 


OPTIMUM ARCHITECTURE 


The only practical way to solve these problems and still 
adequately address all the performance issues we have dis- 
cussed here is to implement an architecture as shown in 
Figure 15. 


In this diagram a single graphics processor assumes the 
responsibility of address and timing associated with the 
graphics frame buffer while maintaining the classical ad- 
dress and data interface with the main CPU and processing 
functionality with its local program storage memory. 


With such a processor, based on a BITBLT architecture, 
frame buffer address operations are its responsibility while 
actual data manipulation is the responsibility of the slave 
BITBLT data manipulation functions. 


The Slave Controller (data manipulator) is a data handling 
chip which receives all control from the single main graphics 
processor. It is responsible for masking, barrel shifting, and 
BITBLT operations associated with its own memory plane. 


A separate control bus from the graphics processor passes 
all control and setup information to the slave manipulators 
in parallel with other control information via the data bus. 
Once this initial information is set-up, the main graphics 
processor is no longer involved in graphics data manipula- 
tion while the graphics function is implemented. The slave 
manipulators can be configured via the control and data bus 
for the exact destination left and right masking, BITBLT op- 
eration, and amount of barrel shift. 


Additionally, operations can occur in parallel with all slave 
manipulators working within their own plane. When plane to 
plane transfers are required, one slave manipulator acts as 
the source and any number or combination of slave manipu- 
lators act as the destination. Here again, any BITBLT opera- 
tion, either within a plane or plane to plane is fully set-up via 
a single graphics processor independent of the number of 
planes used. . 


Most importantly, performance is independent of the num- 
ber of planes. Operations for 32 planes of BITBLT take the 
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same short time as if only one plane were used since the 
architecture is the same. Plane to plane transfers have the 
same speed whether you are transferring the image to one 
plane or 16 planes! 


In summary, in this architecture, the main graphics proces- 
sor is concerned with graphics setup and addressing to the 
frame buffer during the graphics operation. With a BITBLT 
architecture, large data movements, character/text transfer, 
and line drawing and filling all exhibit high performance. 
Since the processor function occurs only once in the sys- 
tem, the higher cost associated with such processors is a 
constant and independent of the number of planes. 


The more cost effective Slave manipulators are concerned 
with the actual data manipulation via control from the main 
graphics processor. They are local to each memory plane 
frame buffer and fully synchronized by the main processor 
for operations within the plane or for plane to plane trans- 
fers. 


This architecture provides one consistent hardware inter- 
face independent of the number of memory planes utilized. 
The hardware interface is the same for one plane or 32. 
This lends itself to a level of software consistency not found 
in any other graphics architecture. 


Finally, the level of parallelism in this architecture creates a 
consistent growth path for performance. As the number of 
frame buffer memory planes is increased, the performance 
does not degrade but remains equal to a single plane archi- 
tecture. In fact, with proper implementation in VLSI compo- 
nents, the true speed limitation of such an architecture is 
limited only by the speed of the DRAM used for the frame 
buffer. 


VLSI IMPLEMENTATION 


National Semiconductor Corporation is developing a graph- 
ics chip set to match this graphics architecture. The first four 
chips in this chip set will be introduced during 1986 and are 
shown in Figure 16 for an example of 3 memory planes. The 
chip set is fully expandable to any number of memory 
planes. 


The RGP (Raster Graphics Processor) is the main graphics 
processor, the BPU (BITBLT Processing Unit) the slave 
data manipulator. The VCG (Video Clock Generator) is a 
timing and control generator for the graphics system with 
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capability of generating clocks in excess of 150 MHz. The: 


VSR (Video Shift Register) acts as a parallel to serial con- 
verter capable of clock rates in excess of 150 MHz. The 
chip set has been designed to support all types of RAM 
components that may be used in the frame buffer, including 
Video DRAM. 


The National Semiconductor architecture is very well suited 
for plane oriented architectures, but can work equally well in 
pixel oriented architectures. Additionally, it is the first chip 
set which will provide a complete solution for mixed mode 
applications. _ 
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Interfacing the DP8500 
Raster Graphics Processor 


1.0 INTRODUCTION 


A graphics subsystem using National Semiconductor's Ad- 
vanced Graphics Chip Set (AGCS) requires some interface 
circuitry between the DP8500 Raster Graphics Processor 
(RGP) and other devices on the bus. Specifically these de- 
vices may include: 


e the Frame Buffers (memory) in each plane, 
e the Bitblt Processing Unit (BPU) in each plane, 
© a data transceiver in each plane, 


e other types of memory used in the system, such as 
EPROM for basic functions and startup, 


® memory-mapped |I/O devices, including latches for con- 
trol of color. 


In general the interface circuitry required between the RGP 
and other devices will consist of two main blocks: 


e the Bus State Machine (BSM), which generates memory 
strobes as required by the particular type of memory be- 
ing accessed, and transceiver control signals, and 

e the Video Plane Controller (VPC), which enables only se- 


lected planes of memory, and ensures correct data flow 
between memory planes. 
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Additionally an Address Decoder block produces various 
enable signals according to the address being generated by 
the RGP. 


The RGP’s bus interface has been deliberately kept gener- 
al-purpose to allow flexibility of memory types; this means 
that the system designer must tailor the interface to suit his 
requirements. The purpose of this application note is to de- 
scribe some of the issues and to suggest some solutions. 


Memory Architecture and Graphics Operations 


The AGCS has an open architecture which permits both 
planar and pixel operations. The memory is organized in 
planes (see Figure 7 ), in which the “color value” of a partic- 
ular pixel is the parallel combination of the corresponding 
bits in separate words of memory. This requires that the 
graphics controller be able to access each plane separately; 
but by dividing the graphics rendering functions between the 
RGP (address generation) and a BPU on each plane (data 
manipulation), the AGCS allows parallel graphics process- 
ing. Thus the time required to render a particular graphics 
object is independent of the number of bits per pixel and the 
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data processing bandwidth is matched to the memory band- 
width with uniform expandability. 


Contrast this with the more common, but less powerful, 
packed-pixel architecture. This requires only a single graph- 
ics controller, with access to just one block of memory. 
Within the memory each word holds the “color value” of a 
number of pixels (for a 4 plane system, i.e., 4 bits per pixel, a 
16-bit memory word would hold the color values of 4 pixels). 
With this architecture therefore, the time required to render 
a graphics object does depend on the number of bits per 
pixel—the system gets slower as the number of bits per 
pixel increases. Thus the performance is limited by the data 
processing bandwidth. 


In addition to being able to perform planar operations (such 
as Character Drawing and Bitbit) at high speed, the AGCS 
can also execute pixel operations (such as Read and Draw 
Point) efficiently. This is achieved by means of the Pixel Port 
in each BPU, which allows the RGP, or some other proces- 
sor, to access data representing a single pixel. 


It is important to understand that the RGP is a processor—it 
fetches instructions from memory and executes them. Spe- 
cific graphics instructions are “hardwired” rather than mi- 
crocoded, thus providing the highest performance for ren- 
dering operations. During these operations the RGP does 
not process the data in each plane—this function is handled 
by the BPU. Thus the RGP does not need to have any 
“knowledge” of the number of planes attached to it. 
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This leads to the concept of addressing duality: during in- 
struction fetches and operand reads and writes, the RGP is 
accessing just one word in memory, and data passes to or 
from the RGP—the “linear’”’ type of access. During “‘draw- 
ing” operations, however, the RGP is providing only address 
and control information to memory. Drawing operations in- 
clude Line Drawing, Bitblt, and higher level operations which 
use these, such as 1 rendering, and also Screen Re- 
fresh. ; 


Thus the system designer needs to build an interface which 
can recognize these two types of access, and act on memo- 
ry accordingly. Furthermore, for drawing operations there 
must exist some mechanism in the interface circuitry to al- 
low the RGP’s programmer to select just one or more 
planes which will take part in the operation. This allows attri- 
butes such as color and intensity of the graphics objects to 
be controlled. 


Figure 2 shows the block diagram of the interface logic. 


The Pixel Port 


The BPU has a multiplexer on-chip, which, in point-wise 
drawing mode (Line Drawing) where one pixel is modified at 
a time, selects one of the 16 bits of the destination word and 
latches it into a single-bit bidirectional port. Conversely a 
single bit can be latched into this port prior to drawing: the 
bit is replicated 16 times and presented to the BPU’s Logic 


‘Unit along with the Destination word. Thus the selected bit 


of the Destination word can be modified by the latched bit in 
conjunction with the programmed logic function. 
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This mechanism is used in the RGP’s Read Point and Draw 
Point instructions, which allow the RGP to read or write the 
“color value” of a single pixel. In addition to uses in color 
arithmetic this mechanism is useful in allowing pixel archi- 
tecture data bases to be easily and rapidly translated to the 
AGCS’s planar architecture. 


Local and Interplane Bitbit 


For Bitblt and derivative operations it is common for the 
Source and Destination rectangles to lie in the same plane 
(Local Bitbit). But it may be a requirement to have a source 
rectangle in one plane with the Destination lying in multiple 
planes (iInterplane Bitbit). This would commonly be used for 
text rendering where the font bitmap is stored only once in 
one plane—perhaps in low-cost DRAM rather than Video 
DRAM, but will be required to be rendered in different colors 
by transfer to various Destination planes. 


2.0 EXAMPLE SYSTEM 


To illustrate the design of an RGP interface a typical small 
system will be used. The actual example is National's 
DP850EB, an evaluation board for the RGP. This is a stand- 
alone 4-plane system with 1M pixels per plane, which has 
basic system firmware in PROM, and a UART for communi- 
cations with a Host computer. The main components of the 
system are: 


e One DP8500 RGP 


@ Four planes, each containing one DP8511 BPU, four 64k 
x 4 Video DRAMs (VRAMs), a transceiver, and a shift 
register for parallel to serial conversion 


e One NS16450 UART 
e Two 2k x 8 87SR193 PROMs for resident firmware. 


Memory Map 


The RGP will start to execute instructions from address 0 
after reset, thus the PROM space must begin at 0. The four 
planes are allocated separate, but congruent, memory 
spaces. This simplifies the system design (but systems 
which need very large amounts of memory for each plane 
might need to map each plane to the same space). These 
considerations, together with the desire for simple memory 
space decoding, result in the following memory map: 


PROM 000000-3FFFFF 
PLANE 0 400000-40FFFF 
PLANE 1 410000-41FFFF 
PLANE 2 420000-42FFFF 
PLANE 3 430000~-43FFFF 
BPU CRE 800001 
BPU PDLE 800002 
BPU POE 800004 
BPUO FSE A00001 
BPU1 FSE A00002 
BPU2 FSE A00004 
BPU3 FSE A00008 


UART 
VPC 


C00000-DFFFFF 
E00000-FFFFFF 


Space Address Range Total 
(Hex) in Space Words 
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BPU Registers 


The BPU Control registers must be memory mapped so 
that, when the RGP automatically loads them (in one ac- 
cess) prior to drawing, the interface logic can generate the 
required BPU control signal (CRE). The BPU Base Address 
(800001 hex) needs to be loaded into the RGP upon Initiali- 
zation. 


Similarly the BPU Function Select registers need to be 
memory mapped so that the user can load them individually. 
During the RGP’s linear access to each register the inter- 
face logic needs to generate the required BPU control sig- 
nal (FSE). 


Other |/0 Devices 


The Bus State Machine and Address Decoder produce an 
Enable ) signal (ENUART), which, together with Read (R) and 
Write (WR) strobes, permit RGP access to the UART regis- 
ters. 


Memory 


The system uses VRAM in page mode for drawing accesses 
only. There is negligible increase in performance for page 
mode during other VRAM accesses. The RGP’s Page Break 
(PB) output is useful in controlling page mode accesses. 
When High at the beginning of an access (PB becomes 
valid during the second half of the RGP’s T1 cycle) PB indi- 
cates that the current access is within the same 256-word 
page as the previous access. A Low on PB indicates that 
the access is to a new “page”, or row, of memory. 


3.0 BASIC TIMING 


The example system uses Video DRAM with the following 
parameters: 


RAS Access Time: 120 ns Max 

RAS Pulse Width 120 ns Min 

RAS Precharge 

Time: 90 ns Min 

CAS Access Time: 60 ns Max 

CAS Pulse Width: 60 ns Min 

CAS Precharge 

Time: 50 ns Min (Page Mode) 
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At an RGP clock frequency of 20 MHz these parameters 
can be met by using the following: 


RAS Low for 3 Clock Periods . 

RAS High for 2 Clock Periods _ 

CAS Low for 2 Clock Periods 

CAS High for 1 Clock Period (Page Mode) 


4.0 ADDRESS DECODER 


This uses the BSM’s IOEN signal to generate the following 
“chip selects”: 


CRE BPU Control Reg Load 
PDLE BPU Pixel Port Latch Enable 
POE BPU Pixel Port Output Enable 


ENPROM PROM Output Enable 
ENUART UART I/O Enable 


LMASK Latch VPC Control Data from RGP 

FSEO — BPU Function Select Reg Load, Plane 0 
FSE1 BPU Function Select Reg Load, Plane 1 
FSE2 — BPU Function Select Reg Load, Plane 2 
FSE3  BPU Function Select Reg Load, Plane 3 


These signals are derived using straightforward combina- 
tional logic. Their timing follows the BSM’s IOEN (see next 
section). | 

5.0 BUS STATE MACHINE 

This generates the following signals: 


RDY RGP Ready Signal 

RAS RAS signal to VRAM 

R/C Row/Column Address Selector to Address Multi- 
plexer 

NCASIN Master CAS Signal to VPC 


WR Write Signal to VRAM and UART 


DT/OE Data Transfer/Output Enable to VRAM, Read 
Signalto UART 

DDIN _ Data Direction Signal to Transceiver on RGP’s 
Bus 

DBE Data Buffer Enable to Transceiver on RGP’s Bus 

DLE BPU Data Latch Enable 

DOE BPU Data Output Enable 

RFIP Refresh in Progress to VPC 

IOEN 


Linear Access Timing Strobe 


Address 


DRAW ACCESS 
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BPU 
DATA 
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INSTRUCTION FETCH 


5.1 BSM Basics 


To derive a scheme for controlling the bus accesses consid- 
er that data flow during Drawing accesses can overlap with 
the start of the next RGP cycle, as shown in Figure 3. In- 
deed, the RGP’s BPU FIFO control signals (FRD and FWR) 
are timed to use this overlap. 


This causes no problem with Local Bitblit; for Interplane 
Bitbit however, since the Source data appears on the com- 
mon bus during Source Reads, it is necessary to “cut” the 
RGP’s data path for the first part of the overlapped access 
as this could be a linear access. 


BSM TIMING 


During each cycle of the RGP the BSM will need to change 
both its state and its outputs based upon its present state 
and inputs from the RGP. 


To avoid having to use very fast logic the example system 
uses a pipelined technique for the BSM: 


e Input changes occurring during a particular RGP cycle 
cause the BSM to change state at the beginning of the 
following cycle. The state is clocked by the rising edge of 
PH1. 


e Output changes which occur as a result of this state 
change happen at the beginning of the cycle after this, 
and are clocked by the rising edge of PH1. 


Consequences of this are that an input change occurring 
during cycle “N’’ will result in an output change in cycle 
“N + 2”—something that must be remembered when look- 
ing at the BSM timing diagrams which follow. 


The one exception to this scheme is the signal NCASIN, 
which is used by the VPC to selectively generate CASO-3 
for the memory planes. NCASIN is a combinational BSM 
output, and is clocked by the rising edge of PH1 in the VPC 
after being processed by more combinational logic (see 
section 6.0). Thus CASO-3 become valid at the beginning 
of RGP cycles, just like BSM outputs. In the timing diagrams 
(Figures 7 to 14) which follow, CASn is shown rather than 
NCASIN, the ‘‘n” representing any one or more of 0 to 3. 


Figure 4 shows the structure of the BSM and the timing 
relationships between inputs, states and outputs. 
BSM Bus Requests 


There are two sources of bus access requests: the RGP 
and the Refresh Timer. The BSM contains two flip flops to 
denote a bus request from these sources: 


RGPRQ: Set by the RGP’s ALE, Reset by the BSM 
RFRQ: 


Set by the Refresh Timer, Reset by the BSM. 
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FIGURE 4. Bus State Machine Timing 


The principle of the BSM described here is that, on getting a 
bus request from one of these sources (with RGPRQ having 
the higher priority), the BSM counts through a sequence of 
states, ultimately ending in state 0, the idle state, where it 
remains until another bus request occurs. The particular se- 
quence used depends on the type of access required, and 
can be determined by decoding the RGP status lines and 
the Bus access requests: 
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5.2 BSM Linear Accesses 


The linear access is used for Instruction Fetches, Operand 
Reads and Writes, and general Input/Output. The previous 
access could have been either a Drawing access (RAS end- 
ing in the cycle after T1) or another linear access (RAS ends 
in previous T3). Both possibilities are shown in Figure 5. 


To meet the VRAM timing requirements for the system we 
need to allow for the worst case (in which the previous ac- 
cess is a Drawing access). Thus RAS for the Linear access 
can start two cycles later as in Figure 5. 


The above analysis is for a relatively simpie Bus State Ma- 
chine. A more complex design could reduce the impact (1 


extra clock cycle) of allowing for a previous Drawing access — 


by recognizing a sequence of Linear accesses. In this case, 


_ because only two cycles of RAS precharge are needed, the 


access can be started one cycle earlier. Additionally, for the 
example BSM design, the same timing is produced for all 
linear accesses regardless of the type of memory being ac- 
cessed—again, a more complex design could vary the tim- 
ing according to the type and speed of memory being ac- 
cessed. 


The complete BSM behaviour for a Linear Read access fol- 
lowed by a Linear Write access is shown in Figure 6. Once 


BSM X 0 1 2 3 
state 
RGP Wy 
state 


PH1 


RFIP 


IOEN 


Conditions for Linear Access: BS1 = L, RGPRQ = H, RFRQ = L 


the previous access has completed (i.e., the BSM has 
reached state 0) the BSM determines that an RGP bus re- 
quest is pending (RGPRQ has been set by ALE going high), 
and starts the Linear access by branching to state 1. 


In subsequent clock cycles the BSM enters states 2, 3 and 
4, and finally state 0. The RGP’s RDY input is asserted dur- 
ing state 4 so that the RGP will complete the access, with 
data becoming valid at the RGP’s inputs at the end of T3. 


Note that R/C, which switches the VRAM address lines be- 
tween Row addresses (when High) and Column addresses 
(when Low) is generated by delaying the RAS signal by half 
a clock period. 


Also, since the RGP will latch the read data from the ac- 
cessed memory or device at the end of T3 during the Read 
access and send data out during the write access, the 
RGP’s Data bus transceiver is enabled by DBE and its direc- 


_ tion controlled by DDIN. The timing strobe IOEN is pro- 
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duced, with the same timing as CASn, so that the Address 
Decoder block can produce the appropriate strobes to the 
accessed device. 

DT/OE pulses low to enable data from the accessed mem- 
ory during the Read access, and WR pulses low to write 
data to memory. 
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FIGURE 6. Linear Read and Write, Preceded by Draw Access 
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5.3 BITBLT Source Read and Destination Write 
Accesses 

The first type of Drawing access to be considered is the 
Bitblt Source Read: this occurs when the BPU FIFOs are 
being filled prior to the Destination Write or Read/Modify/ 
Write accesses during a Bitbit. 

Figure 7 shows the BSM behaviour when a Page Break oc- 
curs, i.e., page-mode memory cycles are not possible be- 
cause subsequent accesses are to different rows (or 
pages”) of memory. During state 8 the RGP’s PB signal is 
determined to be Low, so the BSM branches to state 0 to 
terminate the access (by bringing RAS High). 
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DT/OE is produced to enable the data from memory, which 
is latched into the BPUs with DLE. Note that for interplane 
Bitbits only one memory plane is read, but the data may be 
latched in several BPUs: the VPC logic controls the data 
routing. 

Figure 8 shows the BSM behaviour for page-mode memory 
accesses, 
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FIGURE 9. Bitbit Destination Write, Non-Page Mode 
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FIGURE 10. Bitbit Destination Write, Page Mode 


If, during state 8, the BSM finds that the next access is the 
other basic type of Drawing operation (Read/Modify/Write) 
and there is no Page Break then a branch will occur to the 
corresponding point in its state sequence (state 10). 


Figures 9 and 10 show the state sequences for Destination 
Write accesses; these are identical to the states for Source 
Reads but the outputs differ. In particular the DOE output is 
asserted during the RAS pulse to enable data out of the 
BPUs, and the WR output is pulsed at the end of CASn to 
write the data to the VRAM. 
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5.4 Draw Read/Modify/Write Accesses 


This type of access occurs during both Line Drawing (al- 
ways) and Bitblt (if the Bitblt Combine option is chosen, and 
for both this and the OverWrite option during modification of 
boundary Destination words). 


Figure 11 shows non-Page mode accesses. The BSM uses 
states 9 to 14. DT/OE is pulsed during the first half of the 
CASn signal to read the memory data into the BPUs, where 
it is latched with DLE. (For Bitblt accesses the RGP’s FRD 
signal simultaneously fetches the aligned Source data from 
the BPU FIFO). 
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Conditions for Draw R/M/W: BS1 = H, BSO = L, RGPRQ = H, RFRQ=L,R=L,W=L 
FIGURE 11. Draw Read/Modify/Write, Non-Page Mode 
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During the second half of CASn the BPU’s logic unit output 
data is enabled out to memory by DOE, and written to mem- 
ory by WR. 

If, during state 14, the BSM finds that the next access is the 
other basic type of Drawing operation (Source Read or Des- 
tination Write) and there is no Page Break then a branch will 
occur to the corresponding point in its state sequence (state 
6). 

Figure 12 shows the same accesses in Page mode. 
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5.5 VRAM Transfer Access (Screen Refresh) 

The RGP, if appropriately programmed, will request a bus 
access for VRAM data transfer at the beginning of each 
display scan line, or when the display row address is zero. 
During this access up to 256 words of data are transferred 
from the VRAM memory array to the VRAM shift registers, 
and from there are transferred to the display device. 
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Conditions for Draw R/M/W: BS1 = H, BSO = L, RGPRQ = H, RFRQ=L,R=LW=L 
FIGURE 12. Draw Read/Modify/Write, Page Mode 
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The example system uses a display “warp” of 64 words, 
which avoids the requirement for VRAM transfers in the 
middie of scan lines. So VRAM transfer accesses occur 
only at the beginning of scan lines, during blanking time, and 
well before active video time. 


The VRAM performs a transfer when DT/OE is Low before 
RAS is asserted. The BSM asserts DT/OE in state 16, and 
then asserts RAS in state 17, as shown in Figure 13. The 
remainder of the access is conventional except that the out- 
put RFIP is asserted. This tells the VPC that CAS outputs to 
all planes must be asserted. 


5.6 VRAM Refresh Access 


When the Refresh timer causes a bus request, by setting 
RFRQ, and the current bus access has completed by reach- 
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ing state 0, the BSM will initiate a VRAM Refresh access. 
This uses the VRAM’s internal refresh row counter by as- 
serting CAS Low before RAS. 


The BSM inserts RGP wait states by holding RDY Low until 
the refresh access is completed—the pending RGP bus ac- 
cess can then continue. RFIP is asserted during the refresh 
so that the VPC will assert CAS to all planes. 


Figure 14 shows the BSM’s behavior. 


6.0 THE VIDEO PLANE CONTROLLER 

The function of the Video Plane Controller (VPC) is to en- 
able, under program control, some or all of the planes for 
Drawing operations, and to control the data transceivers on 
each plane. 
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Conditions for VRAM Transfer: BS1 = H, BSO = H, RGPRQ = H, RFRQ = xX,R=L,W=H 
FIGURE 13. VRAM Transfer (Preceded by Draw Access) 
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Pending RGP access continues 


Conditons for VRAM Refresh: RGPRQ = H, RFRQ = H, BS1 = Lor BSO = L 
FIGURE 14. VRAM Refresh 


Plane Memory Control 
Several control methods are possible: 


¢ Disabling inactive planes by programming their BPUs to 
the “Destination = Destination” function. These planes 
therefore write back their images without any changes. 
This method does not require additional hardware, but 
does not allow the use of interplane Bitbit (there will be 
bus contention during Source reads). 


Steering WR to selected planes, so that only active 
planes can change their data. This method also does not 
allow the use of interplane Bitblt. 


Steering RAS to selected planes. This method could in- 
cur extra delay in the RAS pulse, which in many designs 
can not be tolerated. Also this method is not guaranteed 
to work for all VRAMs. 


Steering CAS to selected planes. This method does work 
satisfactorily, and any extra delay in asserting CAS can 
usually be tolerated. 
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CAS steering is the method used in the example system. By 
using the CAS timing strobe, NCASIN, from the BSM prior to 
clocking on the rising edge of PH1, no additional delay in 
CASn is incurred. 


The VPC must be programmed (by an Operand Write from 
the RGP to address E00007 in the example system) for 
which plane or planes are to be active—the ‘Write mask”, 
and if an interplane Bitblit is required. Four RGP data bits, 
DO-3, correspond to the four plane’s Write Mask, and D15 
is used for the interplane bit. The signal (LMASK) which 
latches these bits in the VPC is generated by the Address 
Decoder logic. 

During drawing operations, except for interplane Bitbit 
Source Reads, plane n is enabled (NCASIN from the BSM is 
routed to CASn and clocked on the rising edge of PH1) if its 
corresponding Write Mask bit is a 1. 


During linear accesses (BS1 is Low) and the interplane 
Bitblt Source Reads (BSE and BS1 are High, BSO is Low, 
and the interplane bit is 1) a plane is enabled if the RGP’s 
address lies within the plane’s address bounds. 


During VRAM Transfer accesses and VRAM Refresh opera- 
tions (RFIP is High) all CAS outputs need to be asserted for 
the duration of RFIP. 


Thus in the example system the VPC needs to decode ad- 
dress lines A17, 18, 22 and 23 to detect the plane being 
addressed, and decode RFIP, BS1, BSO, BSE and the inter- 
plane bit. 


It should be noted that during interplane Bitblt Source 
Reads, however, BS1, BSO and BSE are not valid during 
state 8; at this time they will change for the next RGP ac- 
cess. Therefore the VPC must effectively latch their value 
during these accesses. 


Plane Transceiver Control 


The VPC also generates the Enable (BEn) and Direction 
(DIRn) control signals for the transceiver on each plane. 
The sense of DIR is that when it is High data is moved from 
the VRAM to the common bus (i.¢., for memory reads). 


During linear accesses to VRAM (BS1 and A23 are Low and 
A22 is High) BEn is asserted according to which plane is 
being addressed (a decode of A17 and A18). DiRn is the 
inverse of RGP’s R output. 


During linear accesses to the BPU Control registers (CRE is 
Low), all BEn are asserted and DiRn is Low. 


During linear accesses to the BPU Function Select registers 
(FSEn is Low), BEn is asserted according to which one or 
more registers is being accessed and DiRn is Low. 


During Interplane Bitblt Source Reads all BEn are asserted. 
DIRn for the Source plane (the plane being addressed by 
the RGP) is High, and for the other planes (which will latch 
the Source plane’s data in their BPUs) DIRn is Low. 


During all other Drawing accesses the transceivers are dis- 
abled (BEn are High). 
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Thus for transceiver control the VPC needs to decode ad- 
dress lines A17, 18, 22 and 23, and BS1, BSO, BSE and the 
interplane bit. The earlier comment about the validity of the 
RGP status signals during state 8 of the Bitblt Source Reads 
applies to the transceiver control logic as well. 


7.0 INTERFACING TO AN EXTERNAL PROCESSOR 


The open architecture of the Advanced Graphics Chip Set 
means that an interface to an external or co-processor can 
be achieved in a number of different ways, depending on 
the designer’s requirements. 


The technique offering the highest performance is that of 
using shared memory, where the external processor can 
take control of the graphics subsystem’s bus and access 
memory directly. 


The interface design described above requires three basic 
sets of signals from the bus controller. 


e Address Signals (Inputs) 
e Data Signals (Bidirectional) 
e Status Signals (Inputs, Plus RDY Output) 


The BSM design could easily be modified to cater for an 
external processor by adding an input which indicates a bus 
request by the additional processor. Once the BSM has 
granted bus access (putting the RGP in HOLD or inserting 
Wait states), this processor can present its Address, Data 
and Status signals to the BSM, VPC and Address Decoder, 
and access the memory. The BPUs can also be controlled, 
allowing both planar and pixel accesses by the external 
processor. 


8.0 SUMMARY 


The implementation of a memory interface for the National 
Semiconductor DP8500 Raster Graphics Processor is 
straightforward. The RGP provides sufficient status and tim- 
ing signals to interface flexibly and efficiently to any variety 
and mixture of memory and peripheral devices. 

This application note has described the implementation of 
an interface between the RGP and various types of memo- 
ry, by reference to the DP850EB system. 
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The LM1823: 

A High Quality TV Video LF. 
Amplifier and Synchronous 
Detector for Cable 
Receivers 


INTRODUCTION 


The LM1823 is a video I.F. amplifier designed to operate at 
intermediate carrier frequencies up to 70 MHz, and employ 
phase locked loops for synchronous detection of amplitude 
modulation on these carrier frequencies. The high gain, 
wide AGC range and low noise of the LM1823 make it ideal 
for use in television receivers, video cassette recorders and 
in cable TV set-top converters requiring high quality detect- 
ed base-band video and an audio intercarrier. Typical per- 
formance characteristics and features of this |/C are sum- 
marized in Table | below. 
TABLE | 


Maximum system operating frequency 70 MHz 
Typical |.F. amplifier Gain (45.75 MHz) >60 dB 
|.F. amplifier gain control range 55 dB 
True synchronous detector with a PLL 

Detector conversion gain 34 dB 
Detector output bandwidth 9 MHz 


Detector differential gain 2% 
Detector differential phase 1 degree 
Noise averaged AGC system 

Internal AGC gated comparator 

Reverse tuner AGC output 

DC controlled video detection passe 

AFC detector 


THE R.F. SIGNAL FORMAT 


Despite the wide variety of signal sources available to the 
home television receiver—broadcast, cable, satellite, video 
games etc—on channel carrier frequencies from 
55.25 MHz to 885.25 MHz, the spectral content of each R.F. 
channel has been established for many years. In the United 
States the channel bandwidth is fixed at 6 MHz with the 
picture carrier located 1.25 MHz from the lower end of the 
band, and an aural carrier placed 4.5 MHz above the picture 
(pix) carrier. Introduction of color television in the early fifties 
added another carrier, the chroma sub-carrier, positioned 
3.58 MHz above the pix carrier frequency. The pix carrier is 
amplitude modulated* by the baseband video signal (which 
* A more appropriate term is ‘negative downward modulation” since any 
modulating signal causes a decrease in the peak carrier amplitude (com- 
pared to conventional a.m., where the modulating signal alternately increas- 
es and decreases the peak carrier level with the mean carrier level remain- 


ing constant). For television carriers, syncs correspond to peak carrier and 
increasing brightness causes decreasing carrier amplitudes. 
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includes the synchronization information and the phase and 
amplitude modulated chroma subcarrier) while the aural car- 
rier is frequency modulated. Television channels in Europe 
use similar carriers with the refinement of a fluctuating chro- 
ma subcarrier phase (P.A.L.). 


The signal coming into the receiver has this general format 
and the receiver R.F. and I.F. circuits are designed to han- 
die such a signal and reduce it back to the baseband com- 
posite video and audio intercarrier. Even where signal 
scrambling is used to protect the video modulation from un- 
authorized detection, the R.F. spectrum must remain within ~ 
this format. For satellite broadcasts with frequency modula- 
tion of the video signal, the signal is demodulated:and then _ 
remodulated onto a low VHF channel for reception by stan- 
dard television receivers. In connection: with this, the 
LM1823 PLL detector is not suitable for wide-band FM de- 
tection—even though the I.F. carrier (70 MHz) is well within 
the LM1823 I.F. amplifier frequency capability. 


Notice again that the pix carrier is located at one end of the 
occupied bandwidth and only the upper sidebands are being 
fully transmitted. The lower sidebands are truncated with 
only frequencies close to the pix carrier frequency modulat- 
ing the carrier. This method of conserving the frequency 
spectrum is referred to as vestigial sideband transmission. 


THE CABLE CONNECTION 


Originally introduced many years ago as a means for provid- 
ing broadcast TV to isolated areas or where the terrain 
made direct reception difficult, cable TV had modest growth 
in the U.S. and was a stagnating industry until the mid-sev- 
enties. Lower cost satellite earth stations were the turning 
point, allowing cable operators access to many varied pro- 
gram sources from any part of the country. 


TL/H/8421-1 
FIGURE 1. U.S. Broadcast Channel Spectrum 
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FIGURE 2. Broadcast and Cable Bands in the U.S. 
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FIGURE 3. Cable Set-Top Converter Block Diagram 


Standard television receivers in the U.S. tune to VHF chan- 
nels 2 through 13 and UHF channels 14 through 83, and 
initially cable operators used the 12 VHF channels for their 
program material. With increased sources soon all channels 
were occupied on some systems creating significant de- 
mands on television tuner and |.F. amplifier strips. More 
space yet was needed and rather than using UHF channel 
allocations starting at 470 MHz because of cable signal at- 
tenuation (typically 0.8 dB/100 ft. at 300 MHz), operators 
turned to the unused spectrum space between VHF channel 
13 and UHF channel 14. Naturally, since standard TV re- 
ceivers could not tune to these channels, the set-top con- 
verter came into being. Each of the new channels could be 
converted to a low VHF channel to be received on the stan- 
dard TV. Television manufacturers responded, and with the 
common introduction of varactor tuners were soon able to 
Offer ‘‘cable ready” televisions capable of tuning to all the 
new cable frequencies. This meant that customer conve- 
niences such as remote control of channel selection also 
became available. Unfortunately it aggravated a problem al- 
ready confronting the cable operator. Since standard televi- 
sion receivers couldn’t tune to the cable channels, opera- 
tors had been able to offer.premium services on some of 
these frequencies, paid for by subscribers who rented the 
appropriate set-top converter box. This didn’t prove very se- 
Cure since one operator’s ‘free’ channel was another oper- 
ator’s “pay” channel, and the introduction of cable-ready 
televisions ensured the eventual demise of such systems. 


Scrambling the signal, a technique already being used by 
over-the-air subscription television, has become common in 
the cable service. The degree of scrambling’ is limited since 
the scrambled signal spectrum must remain within the chan- 
nel allocation and anything done to the signal must be sub- 
sequently undone without noticable degradation of the sig- 
nal. 


Generally for television, scrambling means a pulse or sine 
wave suppression of the signal horizontal blanking pulse 
interval so that the sync-tips occur between the black and 
white levels instead of always below black level. The stan- 
dard television sync separator does not function well with 
this signal and the |.F./tuner AGC circuits will not work prop- 
erly, effectively scrambling the displayed picture. Other 
techniques include random inversion of the video informa- 
tion to provide an even greater degree of security. 


The means used to encode such a scrambled signal gives 
rise to the terms “in band scrambling” and ‘“‘out of band 
scrambling’. With cable ready television receivers capable 
of tuning to the scrambled channel, the decoder can be a 
simple broad-band gain switch (to change the signal R.F. 
amplitude during horizontal blanking) with a separate receiv- 
er tuned to the decoding data carrier frequency, which is 


*Other security techniques such as jamming or trapping are used but since 
jamming is easy to defeat and trapping requires removal or replacement of 
filters in the cable drop to individual subscribers, scrambling the signal is 
receiving a lot more attention. 
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located outside the signal channel. This permits use of the 
television receiver in a normal way but does require simulta- 
neous switching of the decoder receiver with channel 
changes. 


Also, spectrum space must be reserved for each scrambled 
channel’s data carrier. 


A more popular method of scrambling is “in band scram- 
bling” where the data carrier to decode the signal is includ- 
ed inside the transmitted signal channel, usually within the 
aural carrier. Any number of channels can be scrambled 
and now different levels of service can easily be added or 
deleted without the need to rewire the decoder box. This is 
achieved by including time multiplexed binary “tags” along 
with the sync information so that special programs can be 
identified. Individual subscriber boxes can be similarly ad- 
dressed and turned on or off by the cable operator. In these 
types of systems, the LM1823 and LM2889 have obvious 
applications. The LM1823 is able to provide an excellent 
baseband signal inside the decoder box, which signal is 
then remodulated on a low VHF channel carrier by the 
LM2889 for retransmission to the standard television receiv- 
er. Clearly, the highest possible performance is desireable 
to prevent any noticeable difference between a converted 


channel, whether scrambled or not, and a regular off-air 


broadcast channel. (For a complete description of the 
LM2889 modulator I/C see AN402). 


THE RECEIVER FRONT-END 


The typical receiver front-end consists of a tuner, I.F. ampli- 
fier, |.F. filters and a video/sound intercarrier detector stage. 
These circuits are designed to provide a number of func- 
tions: 


1) Select (tune) a specific R.F. channel in a band of fre- 
quencies. 


2) Provide rejection to adjacent and other channels in the 
band. 


3) Amplify low level R.F./I.F. signals prior to detection of 
the modulation. 


4) Avoid overload on high level R.F. signals. 


5) Trap or attenuate specific frequencies within the chan- 
nel bandwidth to ensure a proper detected frequency 
response is obtained. 


6) Linearly demodulate all desired modulating frequencies 
on the carrier. 


7) Produce a noise-free video signal at the detector. 


8) Provide automatic gain control (AGC) to compensate 
for changing signal strength at the receiver input. 


9) Provide automatic frequency control (AFC) to the tuner 
local oscillator (L.O.) to maintain the carrier intermedi- 
ate frequency (I.F.). 

N.B. Items 8) and 9) have previously been provided in part by circuits exter- 
nal to the conventional |.F. amplifier. However, these functions are 
completely included with the LM1823 leading to overall performance 
improvements and reduction in external parts count and cost. 
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F TL/H/8421-4 
FIGURE 4. R.F. Tuning and I.F. Conversion (Note High 
Side L.O. Reverses the Relative Position of the Picture 
Chroma & Sound Carriers. c.f. Figure 7). 


Although we are not directly concerned with the tuner de- 
sign in this application note, it is useful to understand the 
design goals and constraints on the tuner for at least two 
reasons. First, since the tuner and |.F. amplifier interact very 
closely to obtain and maintain a noise-free picture, we need 
to know something about the tuner in order to provide the 
correct gain distribution and AGC action. Second, when the 
two functions are finally placed together, we need to know 
where to look to solve visible problems that may have be- 
come apparent. In some instances, either the tuner or the 
|.F. amplifier may be at fault, and a good understanding of 
the system interaction is needed to ensure that the appro- 
priate action is taken. — 
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FIGURE 5. Typical Single Conversion Tuner 


Both single conversion and double conversion techniques 
are used in cable converter tuners. The single conversion 
type is similar to the conventional TV receiver tuner and 


consists essentially of an R.F. stage, mixer stage, and local 


oscillator. Usually some input filtering is done to help match 
the cable to the input device and provide some rejection to 
unwanted signals outside the operating channel. Further re- 
jection to unwanted signals, such as the |.F. frequency radi- 
ated back from the I.F. amplifier, is accomplished with inter- 
stage filtering between the R.F. amplifier and the mixer, and 
finally an output filter matches the mixer output to the cable 
feeding the I.F. amplifier. For convenience, we are assuming 


the desired output impedance is 752 and that the major I.F. 
amplifier frequency selectivity is determined by a block filter 
placed between the tuner output and I.F. amplifier input. 
This is consistent with modern practice using surface 
acoustic wave filters (SAWF’s) and high gain stabilized |/C 
amplifiers (LM1823). Even so, as noted in more detail later, 
the LM1823 does provide opportunities for more filtering at 
the I.F. amplifier output prior to the detector stage. 


Dual conversion tuners have been popular for a number of 
years and use first L.O. frequencies that are above the input 
R.F. bandwidth, avoiding problems with L.O. leakage back 
onto the feed cable. The second L.O. and mixer convert the 
high first |.F. to a Ch 3 or Ch 4 carrier for reception by the TV 
receiver. The addition of PLL’s to control the first L.O. and 
descrambling networks on the R.F. output have added suffi- 
cient complexity to such converters that they are now called 
“set-top terminals”. Also, since the scrambling techniques 
have become more sophisticated the signal is now frequent- 
ly converted down to baseband before decoding and re- 
modulation on Ch 3 or Ch 4 carriers. The high first |.F. has 
the advantage that image signal rejection is achieved with- 
out the switchable filters necessary at the input to the single 
conversion tuner. However, the absence of these filters 
does mean that care must be exercised to avoid generation 
of intermodulation products that ‘talk back” onto the cable 
(up conversion of the R.F. signal has been proposed as a 
way to minimize intermodulation components). Another dis- 
advantage of the dual conversion tuner shown in Figure 6 is 
that it typically has a very high Noise Figure, often between 
14 dB to 16 dB. This is because the signal is applied directly 
to the first mixer which is a passive, double balanced diode 
mixer. As discussed in more detail later when we look at 
SAWF’s between the tuner and the |.F. amplifier, a pre-amp 
in front of the mixer can improve the N.F. to 6 dB to 8 dB, 
especially in a baseband converter where an AGC voltage is 
available to help the tuner handle the input signal strength 
range. 


Returning to the single conversion tuner, the major parame- 
ters to be considered are as follows: 


1) Power gain 
2) Noise Figure 
3) Good Cross-Modulation rejection 
4) VSWR 
5) AGC Range 
6) Impedance changes with AGC 
7) Overload capability 
8) Channel 6 beat rejection 
9) Curve tilt (tracking) 
10) L.O. drift and radiation 


For an |.F. amplifier design, items 1), 2), 7), and 8) are the 
most significant, but if the tuner designer has overlooked 
the others we may see some problems when the tuner and 
|.F. amplifier are hooked together. oo 
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FIGURE 6. Dual Conversion Tuner 
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Crossmodulation describes the condition wherein the modu- 
lation information on an adjacent channel (usually) is trans- 
ferred on to the desired carrier. A typical specification is the 
undesired carrier level with 30% modulation needed to 
cause 1% modulation of the desired carrier level. 


Crossmodulation is particularly likely to occur in cable sys- 
tems and is usually observed as sync bars drifting through 
the picture. In particularly severe cases the interfering pic- 
ture can actually be seen. High signal levels at the input of 
the mixer are a frequent cause of crossmodulation, particu- 
larly when high gain R.F. stages are used to obtain a low 
tuner noise figure (N.F.). But when AGC is applied the 
crossmodulation source often shifts to the R.F. device. 


When overload occurs, (measured as the total harmonic 
distortion of a specified modulation frequency), the peaks of 
the R.F. carrier waveform become compressed and this will 
show up at the video detector as a smaller sync pulse ampli- 
tude (sync tip to black level). Since the AGC system oper- 
ates on the sync tip level the effective result is that the black 
level appears to go blacker than black—i.e. some near 
black information will be lost and the picture will appear to 
have too much contrast. Alternatively if the subsequent re- 
ceiver circuits have black level restoration the screen bright- 
ness increases and picture tube blooming on peak whites 
may occur. As overload increases there is a strong chance 
that vertical sync will be lost. Generally the tuner mixer de- 
vice is the first stage to overload, followed by the R.F. stage. 
While overload is caused by very strong signal strengths 
and therefore may appear to be of limited concern it can 
also occur at weak to intermediate signal strengths because 
of incorrect AGC threshold settings and this will be dis- 
cussed in detail later. 


Channel 6 beat is a phenomenon related to mixer overload 
and occurs because of the choice in the U.S. of 45.75 MHz 
as the intermediate frequency. On channel 6, mixing of the 
sound and pix carriers produces a signal at 171 MHz which 
is then mixed with the channel 6 L.O. frequency to give 
42 MHz. The I|.F. sound and pix carriers can also mix with 
the channel 6 L.O. to produce 42 MHz. Since 42 MHz is oniy 
170.455 kHz from the |.F. chroma subcarrier of 42.17 MHz, 
after detection wavey lines will appear in colored areas of 
the picture. Turning down the receiver color level (satura- 
tion) control will eliminate the 170 kHz pattern and identify 
the problem as Channel 6 beat. 


Curve tilt or tracking refers to the ability of the tuner filters to 
track the L.O. frequency as the channel selection is 
changed. Problems in this area are easily identified at the 
video detector output (sometimes referred to as the 2nd 
detector) since the effect is to cause changes in the relative 
amplitudes of the pix, sound and chroma carriers compared 
to that expected from the I.F. filter response. When the de- 
tector VCO and AFT circuits of the LM1823 are aligned to 
45.75 MHz, the chroma burst located on the back porch (or 
breezeway) portion of the horizontal blanking period in the 
video signal will normally be —6 dB compared to the sync 
pulse amplitude. If mistracking is causing a loss of high fre- 
quencies on certain channels, the burst amplitude will be 
lower on these channels and the picture (in severe cases) 
will have watery and noisy colored areas with smeared off 
picture detail. When the loss occurs down at the pix carrier 


0.5 mV TUNER FILTER 
———Pl 3048 = 30dB 


frequency, the burst amplitude is increased and the picture 
will become harsh with excessive overshoots. 


Similar problems can occur on any specific channel simply 
due to mis-tuning or L.O. drift. In particular, as the L.O. fre- 
quency drifts high and the chroma subcarrier amplitude in- 
creases, the sound carrier also increases and chroma/ 
sound beats will appear in the picture. In the U.S. the chro- 
ma/sound carrier beat is at 920 kHz (4.5 MHz—3.58 MHz) 
and appears as a herringbone pattern while the audio mod- 
ulates the sound carrier. This 920 kHz beat can also be 
caused by detector non-linearities, and after the video de- 
tector by the detected 4.5 MHz sound intercarrier mixing 
with the chroma subcarrier in subsequent receiver stages. If 
turning down the color level control removes the 920 kHz 
beat then a better 4.5 MHz trap is needed at the video de- 
tector output. 


These preceding comments are not meant to imply that the 
tuner is the root cause of all the nasty phenomenae that can 
be observed in the picture display. Overload, Channel 6 
beat and video noise are very dependent of the 
tuner/|.F./AGC interaction. To understand why this is the 
case, we need to look at the demands that the input signal 
field strength puts on the system. 
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TL/H/8421-7 
FIGURE 7. Typical Tuner Gain and Noise Figure 


INPUT SIGNAL LEVELS 


The smallest input signal is, of course, no signal or simply 
the noise level generated at the cable drop. To this noise 
level will be added the input noise of the tuner itself, giving 
rise to an equivalent noise input defined by the tuner noise 
figure (N.F.) While a specific design will have to take into 
account the actual operating parameters of the tuners avail- 
able, we will assume a typical tuner configuration with an 
R.F. stage providing 14 dB gain and having a 4 dB N-F., 
followed by a mixer stage with 16 dB conversion gain and a 
16 dB N.F. The N.F. of this combination is 6 dB, a fairly 
typical number, which will have the effect of increasing the 
actual input noise by a factor of 2. If our noise source is the 
cable impedance with a real part of 75, at an ambient 
temperature of 290k, then the equivalent input noise is 2.2 
uVrms (the noise contribution of any matching network or 
cable termination is ignored as this is included in the tuner 
N.F.). 


Cable signal levels run from —6 dBmV to +15 dBmvV witha 
typical system goal of maintaining a C/N ratio of at least 
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FIGURE 8. System Gain Distribution 
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43 dB at the cable drop to the subscriber. If a 0.6 mVrms 
signal is to produce the rated detector output of 3V (o-—p)* 
for the LM1823 then we need a total system gain of at least 
75 dB. Usually the SAWF connected between the tuner out- 
put and the I.F. input will have an insertion loss of 20 dB to 
30 dB so that with the 30 dB tuner gain, the I.F. amplifier/ 
detector is required to provide the remaining 76 dB. If the 
tuner is simply a diode mixer with a 6-8 dB insertion loss, 
the gain requirement increases to 114 dB. 

*(o-p) means the detected zero carrier voltage level to the detected sync tip 
voltage level. The actual peak white signal to sync tip excursion at the detec- 
tor will be 87.5% of this—2.63V (p-p). In the absence of a carrier, thermal 
noise will be present with amplitude peaks on both sides of the detected 
zero carrier voltage. 

Fortunately the LM1823 has a high conversion gain detec- 
tor (84 dB) and the I.F. amplifier gain can be set to well over 
75 dB at 45.75 MHz (but we will see that some gain prior to 
the |.F. amplifier filter will be necessary if a good system 
N.F. is desired). Substantially more gain than necessary 
should be avoided however, even though there is plenty of 
AGC range in the |.F. amplifier (from 48 dB to 60 dB de- 
pending on external components). While at least 22 dB 
AGC capability is needed to accommodate the expected 
input signal strength range, if excessive system gain is used, 
forcing the |.F. amplifier into early gain reduction, the |.F. 
amplifier N.F. will begin to increase. With a diode mixer front 
end, the |.F. amplifier N.F. may contribute directly to the 
system N.F. and prevent noise-free pictures from being ob- 
tained. If a pre-amp or tuner is part of the AGC loop, gain 
reduction should be limited to the |.F. amplifier as much as 
possible, transferring gain reduction to the tuner only when 
the signal strength is high enough to cause distortion or 
cross modulation problems. The tuner gain will prevent the 
prior increase in |.F. amplifier N.F. from impacting the sys- 
tem noise performance, but excess system gain causing 
premature tuner gain reduction will increase the tuner N.F. 
and hence the system N.F. 


Of great interest to us is the R.F.C/N ratio required for the 
detected output to be considered noise free. Actual televi- 
sion video S/N ratios are a little complicated by the fact that 
the displayed video signal does not occupy the full R.F. car- 
rier envelope. 25% of the carrier is reserved for the syn- 
chronizing pulses and 12 14% is retained even under condi- 
tions of peak white modulation, for the benefit of intercarrier 
sound detectors. A common definition of the video S/N ratio 
is the ratio measured in decibels of the peak video signal 
amplitude to the r.m.s. noise voltage amplitude. In this. con- 
text peak video refers to the voltage excursion between 
black and white levels (from 75% peak carrier to 121/2% 
peak carrier). With this definition in mind, it is generally ac- 
cepted that the subjective effect of imperceptible noise oc- 
curs at an S/N ratio of 43 dB. Noise will become perceptible 
(for most viewers) at an S/N ratio around 38 dB; is clearly 
visible but not necessarily disturbing at 34 dB and becomes 
objectionable at 28 dB to 30 dB. Alternatively if we measure 
the signal amplitude as an r.m.s. sine wave with the same 
peak to peak amplitude as the R.F. carrier during the sync 
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TL/H/8421-9 
FIGURE 9. Television R.F. Modulation Envelope 
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pulse period, our signal is free of noise for a 47 dB C/N 
ratio. a 


If the input signal were completely noise-free (i.e. no excess 
noise from head-end amplifiers etc.) then the detected C/N 
ratio is determined by the equivalent input noise level of the 
tuner—2.2 uVrms for a 6 dB N.F. With a minimum signal 
level of 0.5 mVrms the detected C/N ratio will be 47 dB for 
the converter alone. When the actual signal has noise, for a 
cable C/N ratio of 43 dB the noise detected at the converter 
output is now 
Gn = 10-6 V(2.2)2 +. (3.5)2 = 4.13 pV 

This gives a detected C/N ratio of 41.6 dB, a loss of 1.3 dB 
compared to the original signal. For most viewers this is the 
just perceptible level for video noise. On the other hand, if a 
14 dB N.F. converter is used, the detected C/N is 32.7 dB 
which is considered objectionable. A 0 dBmV signal would 
produce 38.7 dB C/N ratio which would be acceptable. Ob- 
viously a low N.F. is important, and any increases in N.F. 
should be carefully controlled to get the best picture quality 
possible. Figure 10 shows the change in N.F. for the 
LM1823 |.F. amplifier. For over 30 dB gain reduction, the 
N.F. is unchanged and increases by only 4 dB for the next 
20 dB of gain reduction. 
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| TL/H/8421-10 
FIGURE 10. Increase in !.F. Amplifier 
N.F. with Gain Reduction 


LM1823-GENERAL CIRCUIT DESCRIPTION 


The basic arrangement of the LM1823 is shown in 
Figure 11. A five stage |.F. amplifier provides gain with a low 
impedance input stage to ensure adequate suppression of 
triple transit echo in SAW filters, and AGC on the three inter- 
stages. The output stage buffers the |.F. signal which is split 
off into two paths. A linear path takes the modulated signal 
to a true synchronous detector while a high gain limiter am- 
plifier passes the |.F. carrier waveform to a second phase 
detector which is part of the PLL for the VCO. The PLL has 
an externally adjustable filter and locks the oscillator in 
quadrature with the incoming |.F. carrier. An in-phase com- 
ponent of the oscillator also drives the linear path detector 
to recover the signal amplitude modulation. An external DC 
control allows fine adjustment of the detection phase in or- 
der to optimize the detector linearity. The output from the 
detector is coupled back into the AGC comparator input, 
and is internally gated during the sync pulse period for good 
nose immunity and a fast response. Two AGC voltages are 
available; an early AGC for the |.F. amplifiers and a late, or 
delayed AGC for the tuner. The take-over point between the 
|.F. AGC and the tuner AGC is set by an external potentiom- 
eter. Also included is an AFT output for fine control of the 
tuner L.O. All these functions are contained in a 28-pin DIP 
with a pin-out designed to facilitate stable p.c.b. layouts— 
even with the high system gain of the LM1823 at frequen- 
cies up to 70 MHz. 
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FIGURE 11. Block Diagram of the LM1823 


i.F. Amplifier Stages: 


The LM1823 I.F. amplifier is composed of five separate 
stages designed to provide high gain primarily in the fre- 
quency range of 35 MHz to 60 MHz, and gain control over a 
60 dB range without overload of any stage and without intro- 
ducing excess noise into the signal. 


To achieve this, AGC is applied to the second through 
fourth stages by a control voltage that is either internally 
generated from the video detector output or from an exter- 
nally applied bias voltage at Pin 13. AGC action starts when 
the voltage at Pin 13 reaches approximately 4 VDC and 
over 50 dB of gain reduction is obtained by the time Pin 13 
voltage reaches 6.5 VDC. For a typical application, the I.F. 
noise figure is around 6 dB for the first 30 dB of gain reduc- 
tion, and then begins to increase to above 10 dB by the time 
the amplifier is gain reduced over 50 dB (see Figure 10). 


As mentioned earlier, the total system gain desired from the 
1.F. amplifier input to the video detector output needs to be 


6 DC FEEDBACK 


15dB MAX 
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selected for a specific set of tuner parameters and I.F. filter 
losses. Excess gain simply means premature AGC action 
with possible loss of optimum video S/N ratios. To see how 
and where the LM1823 gain can be adjusted, we will look at 
each gain stage in turn. 
Input Stage: 
The input stage is a common-base differential amplifier de- 
signed to give good rejection of unwanted I.F. output and 
detector VCO signals that may be radiated back to the in- 
put. The low input impedance of 609 ensures that SAW 
filters are terminated sufficiently to keep the TTE better than 
40 dB below the signal level, even with low impedance 
SAWF’s. Because it is a common base stage, the input 
stage gain is determined by the source impedance present- 
ed to the input. An approximate expression for the gain is 
given by Equation (1) 

Av = 531/(Zs + 60) (1) 
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FIGURE 12. Gain Distribution in the 1.F. Amplifier 
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FIGURE 13. |.F. Amplifier Gain Reduction Characteristic 


As an example, if we use a high impedance SAW filter such | 


as the Murata SAF45 MC series with an output impedance 
that can be modelled as a 2.8 kOQ, resistor in parallel with 
8 pF capacitance, our input Zs is 3450 (including 2 pF input 
stray capacitance) at 45.75 MHz. From (1), the input stage 
gain is 2.4 dB. If a filter is used that matches to the input 
stage with 600, then the gain can be as much as 13 dB. 


A balanced input is extremely important since the input 
leads Pins 6-9 are the most sensitive parts in the system to 
unwanted I|.F. coupling. For example, if the |.F. output cou- 
ples into these pins it can cause changes in the frequency 
response and can easily promote oscillation. A spectrum 
analyzer is invaluable for helping determine the system sus- 
ceptibility to this phenomenon. With the input terminated by 
the I.F. filter (or an equivalent resistor), the I.F. amplifier 
output noise spectrum will show if oscillation is likely 
to occur. Pa | 


Another signal that can appear at the input is the detector 
local oscillator waveform. Unlike quasi-synchronous detec- 
tors, the LM1823 has a constant (and relatively high) oscilla- 
tor signal for good linear detection, even with low input sig- 
nal levels. It is the balance between the input pins to the 
VCO radiation pick-up that will determine whether the p.c.b. 
layout is good enough. VCO pick-up can cause AFC skew- 
ing and assymetrical oscillator pull-in, but probably the most 
serious effect is failure of the oscillator to acquire lock at 
weak signal levels. This is caused by the fact that the PLL 
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phase detector sees two input frequencies—the desired I.F. 
and the undesired L.O. frequency. As a result the L.O. 
“chases itself’ and is driven outside the loop acquisition 
range. 


Again the spectrum analyzer is a useful tool for measuring 


the level of VCO pick-up and the degree of improvement 


that any circuit modification or component relocation 
makes. A good layout will have symmetrical input leads 
placed as close together as possible, shielded input coils 
(where used) and external components mounted as close to 


the 1/C as possible. The DC feedback decoupling capacitor 


connected between Pins 6 & 9 should be right against the 
pins. The pcb layout shown later, even though it uses an |/C 
socket, is able to keep the equivalent VCO input level to 
under 2 uVrms. To put this number in perspective, it is 
~97 dB compared to the original VCO level. For the mea- 
surements, the spectrum analyzer should be connected 
through a FET probe at the I.F. output, which is disconnect- 
ed from the detector stage. The VCO control pin is ground- 
ed, the detector input is de-coupled with a 0.01 uF capacitor 
to ground, and a reference signal CW of the order of 
100 uVrms is applied at the filter input. 


Second and Third Stages 


These are easy to handle since they are completely self 
contained within the LM1823. The maximum gain is fixed at 
17 dB each with 26 dB and 20 dB of gain reduction capabili- 
ty respectively. 


Fourth Stage 


Unlike the preceding stages, the emitters of the fourth differ- 
ential amplifier are available at Pins 3 & 4. An internal resist- 
ance of 13602 between these pins sets the minimum stage 
gain at 4 dB, and under these conditions (Pins 3 & 4 open) 
the stage does not provide significant gain reduction with 
AGC action. However, when an external resistor is connect- 
ed between the emitters, the gain increases. For Pins 3 & 4 
shorted together the gain is as much as 18 dB and the stage 
can provide up to 14 dB gain reduction with AGC action. 
Because of the way in which the total |.F. amplifier gain 
reduction is shared bciween the stages, the effective gain 
increase obtained by a resistor between Pins 3 & 4 occurs 
only for signals below the AGC threshold. After 20 dB of 
system gain reduction the fourth stage is fixed at 4 dB. 


Fifth Stage and I.F. Amplifier Output 


The fifth and final |.F. amplifier stage has a single-ended 
output. There is no internal connection to the detector 
stage, permitting convenient isolation of the IF amplifier and 
detector functions. Pin 1 is also a point at which any addi- 
tional signal filtering may be applied. A resistive load con- 
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FIGURE 14. Low impedance Common Base Input Stage 
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FIGURE 16. Fourth I.F. Amplifier Stage 
Gain with External Resistor 


nected to the 12V power supply can be used, but the maxi- 
mum value is limited in practice to less than 50002. at inter- 
mediate frequencies because of stray p.c.b. capacitance 
and the loading of the detector stage input impedance of 
3 kQ. The stage gain for a total load impedance of Z is given 
by Equation (2) 

AV = 121/48 (2) 
The last part of the |.F. amplifier concerns the power supply 
input at Pin 5. This is a shunt regulated input with a nominal 
value of 6.3V and the I.F. amplifier current is delivered 
through a dropping resistor from the 12V rail supplying the 
remainder of the I/C. The 0.01 uF ceramic r.f. decoupling 
capacitor at Pin 5 should be grounded through very short 
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FIGURE 17. |.F. Amplifier Voltage 
Regulator Current Requirement 


leads—preferably on the copper side of the p.c.b. A nominal 
current level into Pin 5 is 32 mA, set by a 1800 resistor. This 
current should not exceed 60 mA and the minimum current 
is about 20 mA, below which the |.F. amplifier will start to 
lose gain as Pin 5 voltage drops below the regulated level. 


SELECTING THE I.F. GAIN 


Clearly the LM1823, with all the gain provided by five |.F. 
amplifier stages and with 34 dB detector conversion gain, 
has a more than adequate gain margin to provide signal 
sensitivity and compensate for interstage filter losses. To 
show how this gain may be distributed we can look at a first 
cut design example. 
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If we continue with the 30 dB gain tuner with a 6 dB N.F., 
using the tuner 75 output to misterminate the SAWF input 
will produce a very high insertion loss for the filter. This can 
easily be over 30 dB but before using the LM1823 gain ca- 
pability to compensate for this loss, we must look at another 
aspect of filter insertion loss—the N.F. goes up Previously 
we assumed that the tuner N.F. will dominate the system 
N.F.—and with a tuner amplifier N.F. of 6 dB and 30 dB gain 
this is indeed true. But when the I.F. amplifier and SAWF are 
combined the N.F. for the combination exceeds 30 dB. This 
degrades the system N.F. to 7 dB* and after 50 dB of I.F. 
amplifier gain reduction the N.F. will be over 8 dB. Frequent- 
ly this will be alright but it is instructive to consider improving 
the SAWF N.F. by matching the tuner output impedance to 
the filter or using an impedance matching pre-amp. For ex- 


ample, the 10 dB gain pre-amp shown in Figure 18 has a 


4 dB N.F. and reduces the filter loss to less than 20 dB. 
After 50 dB I.F. amplifier gain reduction, the combined N.F. 
is only 27 dB—for a worst case system N.F. of 6.6 dB. Ina 
dual conversion system with a diode mixer (and already high 
N.F.), some gain must be provided prior to the SAWF. 


NFiF 
(Tuner Gain) 


Leaving a 10 dB gain margin over that required to raise a 
—6 dBmV signal to the rated detector output, the total gain 
requirement of the |.F. amplifier is 


75.6 dB — 30dB + 30dB — 34dB + 10dB = 51.6 dB 
(0.5 mV —>» 3V) (tuner) (SAWF) (detector) (gain margin) 


(With a 10 dB gain impedance matching amplifier between 
the tuner and the SAWF, the gain requirement falls by 20 dB 
to 31.6 dB.) To avoid overload in the high gain tuner, we 
probably have to start gain reducing the tuner when the 
input signal reaches +10 dBmvV (but certainly not before 
O dBmvV in order to preserve the tuner NF) so that the I.F. 
AGC range requirement is approximately 26 dB. This 
amount of AGC range can be obtained without a resistor 
connected between Pins 4 & 5 putting the fourth stage gain 
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TL/H/8421-19 
FIGURE 18. impedance Matching Pre-amplifier 


at 4 dB. The SAWF impedance sets the input stage gain at 
3 dB for a total of 41 dB to the input of the final stage. A 
180 resistor at Pin 1 gives the desired last stage gain of 
11 dB, or this resistor is reduced to 509 and a 10 dB pad is 


inserted between the I.F. amplifier output and the detector 


. input when a pre-amp is used. 


LM1823 VIDEO DETECTOR 


The second major function of the LM1823 is the video de- 
tector stage, including the AFT/AFC detector and AGC de- 
tector/amplifier. ; 


The video detector stage of the LM1823 has a fixed conver- 
sion gain of 34 dB—giving a 60 Vrms input level for a 3V 
(o-p) detected output. This input level is required for AGC 
action to commence and is well below the input level that 


' can cause intermodulation or catastrophic overload. 


Synchronous detection of an amplitude modulated carrier 
involves a source of constant amplitude CW with the same 
frequency as the signal carrier, and two phase detectors. 
One detector is operated in quadrature—i.e. the CW phase 
and the signal carrier phase have a 90 degree difference at 
the inputs to the phase detector. This detector operates 
solely to keep the CW source phase-locked to the signal 
carrier. The second phase detector has synchronous or in- 
phase inputs so that the detector output responds to the 
amplitude difference between the inputs and therefore 
tracks the signal amplitude modulation. 


_ The benefits of synchronous detection over envelope de- 
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tection are well known, and most modern receivers incorpo- 
rate a type of detector known as a quasi-synchronous de- 
tector, which is a signal amplitude detector. The I.F. signal is 
amplified and stripped of modulation in order to:be used as 
the detector CW. The disadvantages of this type of detector 
are the loss of linearity at very low signal inputs (corre- 
sponding to peak video modulation) and’a fundamental 
compromise in the bandwidth of the limiter stage used to 
strip the modulation. To maintain ease of tuning and a rela- 
tive immunity from center frequency drift caused by temper- 
ature changes and aging, the limiter bandwidth is sufficiently 
wide that the resulting CW is phase modulated by the infor- 
mation on the original I.F. carrier. Since this can generate 
intermodulation products, a high Q is desirable and a trade- 
off in ease of alignment occurs. 


A less obvious problem with this type of detector is the actu- 
al static detection phase that is being regenerated. Internal 
I/C related phase shifts cause the limited carrier waveform 
applied to the detector to be more or less than 0 degrees 
phase-shifted with respect to the signal carrier phase. A 
loss in detector efficiency results, but if the limiter tuning is 
adjusted to compensate for this, the CW phase from the 
limiter will depend on the drive to the limiter. The detection 
phase then changes with amplitude modulation of the origi- 
nal |.F. carrier. The.effect of this is observed primarily as 
differential phase in the chroma subcarrier signal and _in- 
creased levels of sound buzz. Although, as discussed later, 
the desired phase difference between the detector CW and 
signal carrier is not necessarily 0 degrees, the limiter tuning 
cannot be used to correct the amplitude modulation detec- 
tor phase—the limiter must be center tuned to avoid carrier 
phase shifts with modulation level. 


The LM1823 overcomes these problems by providing a true 


synchronous detector system, which, as the block diagram 


shows, comprises of an internal VCO and in-phase and 


quadrature phase detectors. The incoming signal from the 
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FIGURE 19. Limited I.F. Carrier Phase Shifts with Input 
Amplitude when the Limiter Tank is Mistuned 
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TL/H/8421-21 
FIGURE 20. LM1823 Synchronous Detector and DC 
Controlled Detection Phase 


|.F. amplifier is split into two paths. One path is through a 
high gain limiter stage which strips the amplitude modulation 
from the CW and applies it to one input of the quadrature 
phase detector. The other detector input is from the VCO 
and, once synchronized to the intermediate frequency, if the 
VCO phase deviates from a 90 degree relationship with the 
limiter CW phase, a control current is generated by the 
phase detector and is filtered at Pin 18 to correct the VCO. 
Even though the limiter stage tuned circuit faces the same 
compromises of desired narrow bandwidth versus ease and 
stability of tuning, the filter at Pin 18 can be made to have a 
very narrow bandwidth. Therefore the VCO can provide a 
reference signal to the phase detectors with a high degree 
of spectral purity. The second path for the |.F. signal is di- 
rectly to the in-phase detector. The VCO output passes 
through a DC voltage controlled phase shifter before being 
applied to this detector. The DC.phase shifter allows precise 
adjustment of the synchronized VCO phase for maximum 
amplitude modulation detection efficiency, and compen- 
_ Sates for any internal I/C phase shift variations. At the same 
time, proper center-tuning of the limiter coil is possible. 


The benefits of center-tuning the limiter are clearly shown 
by comparing the differential chroma phase of the LM1823 
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TL/H/8421-22 
FIGURE 21. Relative S/N Ratio with 
Limiter Tuning (No SAWF) 


with a conventional quasi-synchronous detector. The 
LM1823 can consistently produce DP’S of under 1 degree 
compared with up to 10 degrees for a quasi-synchronous 
detector. There is also a substantial improvement in the 
sound carrier S/N ratio. When the limiter is detuned to com- 
pensate for internal |/C phase shifts or for detection phase- 
lags to produce video overshoots (for a subjectively crisper 
picture), the S/N ratio degrades by 5 dB to 7 dB, depending 
on the video modulating signal. 
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FIGURE 22. Effect of Limiter Retuning with SAWF 


THE LIMITER 


The limiter tuned circuit at Pins 24 and 25 is driven by a 
differential stage with a 6.6 kf. internal load impedance. A 
small signal gain of 50 (with a tuned circuit dynamic resist- 
ance of 8 K{) ensures that full quadrature detector efficien- 
cy is obtained with input levels above 10 mVrms, and inter- 
nal Schottky diodes limit the maximum amplitude at Pins 24 
and 25 to about 500 mV (p—p). Tuning is achieved either for 
a peak amplitude signal measured with an F.E.T. probe (low 
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capacitance) at Pin 24 or Pin 25 with a 10 mVrms CW input, 
or by monitoring the video detector and adjusting for mini- 
mum differential chroma subcarrier phase. The latter adjust- 
ment will require a signal source modulated with a chroma/ 
video ramp or stair-step pattern including a 20 IRE level 
chroma subcarrier, but does have the advantage that the 
adjustment can be made at strong signal levels, and does 
not require dis-connection of the tuner. 


AFT/AFC CIRCUIT 


The AFT phase detector is a doubly-balanced phase detec- 
tor with the switching signal provided internally from the lim- 
iter stage described previously. The quadrature signal input 
is obtained by light external capacitative coupling from the 
limiter tuned circuit to the AFT tuned circuit at Pins 23 and 
26. Parallel p.c.b. tracks to the limiter and AFT coils will 
usually provide sufficient coupling and the 1 pF capacitors 
on the LM1823 test circuit (see LM1823 data sheet) are 
shown only to illustrate the level of coupling involved. Since 
the AFT tuned circuit is driving an amplifier with a differential 
input resistance of 20 kQ, it is able to operate close to the 
unloaded Q of the inductor. 
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FIGURE 23. AFT Circuit with pcb Coupling 
Between the Limiter & AFT Tuned Circuits 


The AFT output Pin 27 is driven from a current source so 
that the output voltage at the proper center frequency is set 
by an external resistive divider network. The parallel resist- 
ance of this divider will determine the voltage swing ob- 
tained for a given frequency deviation and in combination 
with the AFT tuned circuit Q, provides a means to adjust the 
AFT output slope. 


Once outside the desired tuning range the AFT output volt- 
age should stay either close to ground (I.F. frequency high) 
or close to the positive supply voltage (I.F. frequency low). If 
the voltage moves back towards the center voltage as the 
signal moves further away from the desired tuning range, 
then more coupling from the limiter tank may be needed. 
Grounding Pin 26 through a 2 kQ. resistor will defeat the 
AFT circuit for receiver fine-tuning purposes. The 2 k©. pro- 
vides isolation of the AFT switch & associated cable from 
the tuned circuit which has a relatively low dynamic resist- 
ance of 1.8 k®.. Resistor values larger than 2 kX. may pre- 


vent the circuit from being defeated, but either Pin 23 or Pin 


26 can be grounded directly without damaging the 1/C. 
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FIGURE 24. AFT Circuit Output Voltage Characteristic 
(RLOAD at Pin 27 = 10k) 


THE PHASE LOCKED LOOP (PLL) 


For true synchronous operation the LM1823 has an internal 
VCO operating at the video intermediate frequency of 
45.75 MHz. 


A parallel tuned circuit between Pins 19 and 20 will set the 
oscillator free-running center frequency and the tuned cir- 
cuit dynamic resistance is loaded by an internal 1.5 kQ. re- 
sistor. Since the oscillator frequency must be controlled, a 
basic tradeoff exists between oscillator stability, control sen- 
sitivity and control range. To obtain a control range of over 
2 Mrz, the working Q of the tuned circuit should be around 
15. Increasing the Q by raising the capacitative arm of the 
tuned circuit will improve the oscillator stability. This reduc- 
es the change in free-running frequency as a result of tem- 
perature effects etc. The control sensitivity will decrease 
correspondingly and there will be a reduction in the control 
range. The control range in the application circuit has been 
chosen to cover the expected deviations in the |.F. carrier 
that are allowed by AFT circuits. With a coil unloaded Qu of 


MHz 


VOLTS (PIN 18) 


| —TLVH/8421-26 
FIGURE 25. VCO Control Sensitivity Characteristic — 


55, and a working Q of 15, the inductance should be 
0.24 uH, which tunes with 51 pF at 45.75 MHz. 


The V.C.O. frequency is adjusted by injecting a 60 mVrms 
CW at Pin 28. If the VCO tuning (Ls) is a long way from 
being correct, the detector output Pin 16 will show an AC 
signal of about 4V (p—p) centered around 7.5 VDC. As the 
oscillator is tuned toward the correct frequency the AC beat 
note will decrease and abruptly disappear as the oscillator 
locks to the carrier frequency. Final adjustment of the VCO 
is done by tuning Lg until the voltage at the phase detector 
filter Pin 18 is 4 VDC. 


Oscillator control is accomplished by internally phase shift- 
ing the currents in a direct cross-coupled differential stage 
in response to the control voltage developed at Pin 18. Di- 
rect cross-coupling of the bases and collectors of this differ- 
ential stage means that the transistors are operating in a 
soft-saturated mode, enabling a constant output amplitude 
to be obtained of about 500 mV (p—p). This output ampli- 
tude does not change with coil tuning or over the frequency 
control range of the oscillator. With the specified tuning 
components at Pins 19 and 20, the VCO sensitivity is 1.5 
MHz/volt. Other general characteristics of the VCO are a 
negative temperature coefficient of 150 ppm/degree C, and 
a tendency for the oscillator control sensitivity to decrease 
with decreasing frequency of operation (below 10 MHz). 


The VCO tuning components are mounted across the I/C 
package from the I.F. amplifier input. This minimizes induc- 
tive coupling and yields approximately 105 dB isolation for 
the I/C alone. Leads and components connected to the I.F. 
amplifier input will reduce the VCO isolation (as will higher 
operating frequencies). 
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FIGURE 26. LM1823 VCO Circuit 


The quadrature phase detector output is a push-pull current 
source so that the control voltage at Pin 18 is determined by 
the parallel resistance of the external divider network, which 
also sets the quiescent control voltage in the absence of an 
I.F. signal. This divider voltage should be centered at 4 VDC 
since the lower voltage swing for controlling the oscillator 
frequency is 2 VDC, and an internal clamp prevents Pin 18 
increasing above 5.6 VDC. By using a 20 kQ parallel resist- 
ance at Pin 18, the phase detector current of 7.5 uA/degree 
gives a phase detector sensitivity (1) of 0.15 volts/degree. 
This parallel resistance is equivalent to R1 in the conven- 
tional filter for a 2nd order PLL. The oscillator and phase 
detector sensitivities given above yield a DC loop gain of 
12.9 MHz2/radian. For the data sheet value of 1009 for R2, 
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and a filter capacitor of 0.1 uF, the loop damping factor (K) 
is 1.01 and the natural resonant frequency (w) is 32 kHz. 
From this we can calculate that the loop —3 dB bandwidth 
is 73 kHz which is substantially less than would be practica- 
ble with a quasi-synchronous detection system, and this 
brings the desired benefits of low luma/sound/chroma 
crosstalk and freedom from quadrature distortion produced 
by the |.F. filter slope characteristic in the vicinity of the 
picture carrier frequency. Nevertheless, some signal condi- 
tions may cause wider PLL bandwidths to be used. A proba- 
ble problem is incidental carrier phase modulation (ICPM). 


20 25 30 35 40 45 50 55 6.0 


VOLTS (PIN 22) TL/H/8421~28 
FIGURE 27. DC Controlled Phase Shifter Characteristic 


This describes the shift in carrier phase as the modulation 
depth changes, and is particularly likely to happen where 
prior processing of the original carrier waveform has oc- 
curred—in distribution or conversion amplifiers employed in 
MATV and cable systems for example. It is also present to 
an extent in broadcast transmitters and if the PLL loop 
bandwidth is too narrow for the VCO to track this phase 
shift, then the ICPM is transferred to the signal modulation. 
This can be observed as a tint shift in color bars or a smear 
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in the leading edge of a color bar as the VCO belatedly 
attempts to track the phase change. For these types of sig- 
nals it is desirable to increase the loop bandwidth to about 
500 kHz—changing R2 to 6802 is an easy fix. The loop 
damping factor is kept greater than 1 to avoid ringing on the 
phase transients. Larger loop bandwidths will increase the 
possibility of luma/sound/chroma crosstalk. - 


Once the VCO is locked in phase to the I.F. signal, the DC 
phase shifter Pin 22 is normally around 4 VDC for peak 
detector efficiency. Usually some extra phase lag will be 
introduced since a subjectively crisper picture is obtained if 
picture transients have an overshoot. Between 12% and 
20% overshoot without ringing is desirable, corresponding 
to a 400 mV to 800 mV shift in Pin 22 voltage. 


VOLTS (PIN 22) 
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FIGURE 28. Signal Overshoot Produced 
by Carrier Detection Phase Shift 
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FIGURE 29. LM1823 Detector and Video Amplifier 
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VIDEO DETECTOR POST AMPLIFIER 


The response of the video amplifier is rolled off above 
9 MHz to minimize the amount of the VCO waveform and its 
harmonics appearing in the output at Pin 16. Typical oscilla- 
tor products are 40 dB below the desired signal level. 


Zener diodes are used in the video amplifiers for level shift- 
ing so that the use of PNP transistors is avoided and the 
detector linearity is preserved. Excellent differential gain 
characteristics are obtained—typically less than 3%. Pin 16 
is a Darlington NPN emitter follower output. With no detec- 
tor CW input signal, Pin 16 is at 7.6 VDC, representing zero 
carrier level which is slightly higher than peak white 
(by 121%). As the CW input increases, Pin 16 voltage de- 
creases towards black level with the sync pulses producing 
the most negative detector level. 


The level reached by the sync tips is determined by the 
AGC loop threshold and if the internal AGC comparator is 
used (Pin 16 is directly connected to Pin 17), the sync tips 
will be clamped at 4 VDC. This produces a nominal detector 
output of 3.2V (p—p) but this is subject to variations in the 
Pin 16 detected zero carrier level. The resistive network 
shown connected between Pin 16 and Pin 17 in Figure 30 
can be used to change the zero carrier level at Pin 17 for an 
adjustable recovered video level. For best performance the 
recovered video level should never be less than 1V (p—p) or 
greater than 4V (p-—p). In suppressed sync systems, the re- 
covered video at Pin 16 is routed to the descrambler for 
restoration of the sync amplitude before it is applied to Pin 
17. Obviously the signal DC content must be preserved 
through the descrambler if proper AGC action is to be main- 
tained. : : 


AGC Self Gating Comparator (LM1823) . 


The AGC comparator input has a low pass filter to protect 
the AGC loop from noise interference. Conventional detec- 
tor systems often use noise gates to prevent the AGC sys- 
tem ‘backing off” on noise peaks that occur below the sync 
tip level. It is difficult to set the noise gate threshold close 
enough to the sync tip level for it to provide any benefit 
without risking AGC lock-out. For the LM1823 however, syn- 
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FIGURE 30. LM1823 Self Gating AGC Comparator 


chronous detection allows the noise gate to be eliminated. 
Since the noise is random phase, the synchronous detector 
will not rectify the noise voltage and the low pass filter can 
average out the noise input to the comparator. 

Further protection of the AGC comparator is provided by 
gating the comparator on only during the sync pulse period. 
The gate pulse is obtained from the input video waveform 


sync pulses at Pin 16. Essentially an emitter coupled sync. 


stripper circuit, the slice level is set by an external time con- 
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stant at Pin 14. During the sync pulse period the capacitor at 
Pin 14 is being charged toward ground potential and the 
comparator is gated on. Between sync pulses the capacitor 
discharges towards the positive supply voltage through the 
resistor and the comparator is off. The sync slice level is 
determined by the Pin 14 RC time constant and is given in 
Equation (3) as the number of millivolts the slice level is 
above the sync tip voltage. 


V SLICE = 1/2 RC (mV) (3) 
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FIGURE 31. RF AGC Amplifier 
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A typical slice level for a 3V (o-p) video signal is between 
100 mV and 250 mV. Different slice levels can be obtained 
with other capacitor values (the resistor should be left un- 
changed). Small capacitors will allow a faster response to a 


fluctuating sync tip level but also may cause the conse- 


quently deeper slice to include video overshoots. 
RE AGC DELAY AND OUTPUT AMPLIFIER 


The I.F. amplifier is at full gain below 4 VDC on Pin 13. At 


anywhere from 5.5 VDC to 6.5 VDC we will want to shift gain 
control into the R.F. stages, and this is accomplished by a 
delayed AGC threshold control at Pin 12. When the filter 
voltage on Pin 13 is 0.7V above the pre-set level on Pin 12, 
the R.F. AGC amplifier at Pin 11 will start to sink current. 


The capacitor shown connected between Pins 12 and 13 is 
optional and intended to provide an increase in AGC action 
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for signal amplitude transients at high R.F. signal levels (tun- 
er in gain reduction). AC changes on Pin 13 are coupled to 
the threshold level control-allowing the |.F. amplifier to gain 


reduce (or increase) during the signal transient. This hap- 


pens only during the signal change, so that the detected 
video returns more rapidly to the proper output levels. Once 
signal equilibrium is restored, the appropriate gain balance 
between the R.F. and |.F. amplifiers returns. 


CONCLUSION 


This note has described a high quality video |.F. amplifier/ 
detector combination that can provide excellent baseband 
video signals. A complete schematic of the external compo- 
nents required in such an application is shown in Figure 32, 
with a suitable p.c.b. layout in Figure 33. 
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FIGURE 32. LM1823 External Circuit Components. 
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FIGURE 33. LM1823 Printed Circuit Board Layout (Component Side) 
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LM2889 R.F. Modulator 


Introduction | 

Two |/C RF modulators are available that have been espe- 
cially designed to convert a suitable baseband video and 
audio signal up to a low VHF modulated carrier (Channel 2 


| through 6 in the U.S., and 1 through 3 in Japan). These are 


the LM1889 and LM2889. Both I/C’s are identical regarding 
the R.F. modulation function—including pin-outs—and can 
provide either of two R.F. carriers with dc switch selection of 
the desired carrier frequency. The LM1889 includes a crys- 


tal controlled chroma subcarrier oscillator and balanced | 


modulators for encoding (R-Y) and (B-Y) or (U) and (V) color 
difference signals. A sound intercarrier frequency L-C oscil- 
lator is modulated using an external varactor diode. The 
LM2889 replaces the chroma subcarrier function of the 
LM1889 with a video dc restoration clamp and an internally 
frequency modulated sound intercarrier oscillator. 


Modulation Parameters 


in the U.S., either of two R.F. channels is made available SO 
that the user can select a vacant channel allocation in his 
geographic area, thus avoiding co-channel problems with 


SOUND 
OSC 


RF 
OSCILLATORS 


antenna input and the tuner. 


National Semiconductor 
Application Note 402 
Martin Giles | 


older receivers that have inadequate shielding between the 


The characteristics of the R.F. signal are loosely regulated 


by the FCC under part 15, subpart H. Basically the signal 


can occupy the standard T.V. channel bandwidth of 6 MHz, 
and any spurious (or otherwise) frequency components 


‘more than 3 MHz away from the channel limits must be 


suppressed by more than —30 dB from the peak carrier 
level. The peak carrier power is limited to 3 mVrms in 750 
or 6 mVrms in 3000, and the R.F. signal must be hard-wired 
to the receiver through a cable. Most receivers are able to 
provide noise-free pictures when the antenna signal level 
exceeds 1 mVrms and so our goal will be to have an R.F. 
output level above 1 mVrms but less than 3 mVrms. Since 
the distance from the converter to the receiver is usually 
only a few feet, cable attenuation will rarely be a problem, 
but mis-termination can change both the amplitude and rel- 


ative frequency characteristics of the signal. 


The standard T.V. channel spectrum has a picture carrier 


located 1.25 MHz from the lower band edge. This carrier is 


amplitude modulated by the video and sync signal. In the 
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Modulation Parameters (Continued) 
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FIGURE 2. Television Channel R.F. Spectrum 


case of a color signal, a second subcarrier is added 
3.58 MHz above the picture carrier. The sound or aural carri- 
er is 4.5 MHz above the picture carrier and is frequency 
modulated with the audio signal to a peak deviation of 25 
kHz. This audio signal has pre-emphasis above 2.1 kHz (a 
75 ws time constant). Similar modulation methods and stan- 
dards are used in Japan and Europe. 


With the picture carrier located near one end of the channel 
bandwidth, most of the available spectrum is used by the 
upper sideband modulation components. Only modulating 
frequencies within 0.75 MHz of the carrier frequency are 
transmitted double sideband and the lower sideband is trun- 
cated by at least — 20 dB compared to the peak carrier level 
by the time the lower channel edge is reached. This is re- 
ferred to as Vestigial Sideband (VSB) modulation and since 
most R.F. modulators are double sideband, a VSB filter is 
used at the transmitter output. A filter is needed for each 
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channel and consists of bandpass and harmonic filter sec- 
tions. A broadcast transmitter uses a separate modulator for 
the sound carrier and this is added to the picture carrier via 
a diplexer before reaching the transmitting antenna. Close 
control is maintained on the picture and sound carrier fre- 
quencies to keep a 4.5 MHz spacing between them. This 
tight frequency control is used to advantage by the majority 
of television receivers which employ intercarrier sound cir- 
cuits. The |.F. amplifier processes both the pix and sound 
|.F. carriers and detects the 4.5 MHz difference frequency at 
the video detector stage. This frequency modulated sound 
intercarrier is then stripped of amplitude modulation by a 
high gain limiter circuit and a quadrature demodulator recov- 
ers the audio. 


The LM1889 and LM2889 use a slightly different modulation 
scheme to that described above for several reasons. For 
circuit economy L-C oscillators are used to generate the pix 
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FIGURE 3. Broadcast Transmitter Block Diagram 
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FIGURE 4. LM1889/2889 Sound and Video Modulation 


carrier frequencies. The stability of such oscillators is good 
enough for the AFT circuits in modern receivers to maintain 
picture quality, but if a separate L-C sound carrier oscillator 
were used, the relative drift of the two carrier frequencies 
would be much too great for intercarrier sound receivers. 
For example, a typical television sound circuit tuned to 
4.5 MHz will generate as much as 3% distortion if the differ- 
ence between the R.F. carriers changes by 15 kHz. Apart 
from the difficulty of setting the initial frequency with suffi- 
cient accuracy, it is unlikely that two L-C oscillators could be 
kept within 15 kHz of each other at 60 MHz to 100 MHz 
operating frequencies. However, when the audio signal is 
modulated onto a 4.5 MHz intercarrier oscillator frequency 
and this carrier is used to modulate the picture carrier, we 
have only the 4.5 MHz oscillator drift to worry about. 


A less obvious problem, but nevertheless significant if good 
audio quality is to be obtained, is incidental carrier phase 
modulation (ICPM). Even broadcast transmitters cannot 
maintain an invariant carrier phase as the modulation depth 
changes. Without feedback loops to control ICPM, a broad- 
cast transmitter can produce from 3 degrees to as much as 
30 degrees phase change as the carrier modulation de- 
creases from sync tips to peak white. While the separate 
sound carrier is unaffected by this ICPM of the pix carrier, 
on reception in the intercarrier sound receiver the phase 
shift with picture information is transferred onto the 4.5 MHz 
sound intercarrier. This results in a phenomenon known as 
sound buzz. Even with exceptionally careful p.c.b. layout, an 
i/C modulator with L-C oscillators can expect the pix carrier 
frequency to change with modulation depth. Fortunately, by 
modulating the sound signal as a 4.5 MHz intercarrier onto 
the pix carrier, the ICPM occurs equally in both R.F. carriers 
and will not be detected by the intercarrier receiver. 


Video Modulation 


The baseband input to the modulator is in an easily recog- 
nized composite format and this is a convenient point at 
which to introduce the I.R.E. scale. This is an oscilloscope 
scale divided into 140 units. The video portion of the signal 
representing the scene (picture) brightness levels will occu- 
py the 0 to 100 |.R.E. portion of the scale, with 0 |.R.E. as 
black level and 100 I.R.E. as peak white level. From 0 to 
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— 40 |.R.E. is the synchronization portion of the signal. The 
usefulness of this scale is that the standard composite vid- 
e0 signal will always have a sync amplitude that can be 
normalized to 40 |.R.E. Similarly the color burst amplitude is 
always 40 |.R.E. For a 1V (p-p) video signal, an 1.R.E. unit is 
equivalent to 7.5 mV. 


Although the video is amplitude modulated on the carrier 
waveform, the carrier amplitude only decreases from the 
unmodulated level. This contrasts with standard AM where 
the carrier level alternately increases and decreases about 
the unmodulated level. For a television signal, the peak un- 
modulated level corresponds to sync tip: level and increas- 
ing brightness levels cause decreasing carrier levels. To 
prevent complete suppression of the carrier (and conse- 
quent loss of the sound intercarrier in the receiver) the peak 
white signal is limited to a maximum modulation depth of 
87.5% of the peak carrier. Returning to our |.R.E. scale we 
can see that from peak carrier to zero carrier is equivalent to 
160 |.R.E. (140/0.875 = 160). One obvious consequence 
of this modulation scheme is that the video signal MUST BE 
dc coupled to the modulator. AC coupling will cause the 
peak carrier level to change with modulation scene bright- 
ness (standard AM) and the sync modulation amplitude will 
change. This spells trouble for the receiver sync circuits and 
the changing R.F. carrier black level will cause errors in dis- 
played ere nnees ate picture will “wash out” or disappear 
into black. 

The LM2889 uses doubly balanced modulator circuits with 
an L-C oscillator switching the upper transistor pairs. The 
signal is applied across the lower transistor pairs. If the sig- 
nal input pins 10 and 11 are at the same dc potential, the 
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Video Modulation (Continued) 
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FIGURE 6. LM2889 Video DC Restoration Clamp 


carrier is completely suppressed. As the offset voltage be- 
tween pins 10 & 11 is increased, the carrier output level 
increases. With a 752 output load resistor, the conversion 
gain of the R.F. modulator is 20 mVrms/volt. A de restora- 
tion circuit at pin 2 of the LM2889 allows the composite 
video to be ac coupled from the preceding stages, giving 
the designer flexibility in the video processing circuits (un- 
less an LM1886 is being used as a video source, it is unlike- 
ly that the composite video dc level will be correct, even 
with dc coupled video sources). On a 12V supply, pin 2 
clamps the sync tip of the video waveform to 5.1 VDC. 
Therefore, if we have a 2V (p-p) signal, one I.R.E. is equiva- 
lent to 14.3 mV and 160 I.R.E. is 2.29V. This is the required 
offset across the modulator input pins and since pin 11 will 
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be clamped to 5.1 VDC by the dc restorer circuit, pin 10 
should be biassed at 5.1V + 2.29V = 7.4 VDC. A look at 
the R.F. carrier output will confirm that now the syncs occu- 
py from 100% to 75% of the peak carrier, and that white 
modulates the carrier down to 1214% of the peak. To main- 
tain the proper modulation depth the clamp at pin 2 will 
track with supply voltage changes, allowing the bias at pin 
10 to be set with a resistive divider connected between the 
supply and ground. 


If the video signal polarity is reversed with positive syncs, 
either a dc coupled signal or an external dc restorer should 
be used that places the signal sync tip voltage towards the 
upper end of the common-mode input range at pin 11, 
which is 9 VDC with a 12V supply. Pin 10 is then offset 
below pin 11 voltage by the required amount for proper 
modulation. An input level of 2V (p-p) is optimal. Signal am- 
plitudes of less than 1V (p-p) are also useable but internal 
offset voltages and the potential for carrier feedthrough or 
leakage to the output stage may make it difficult to maintain 
good RF. linearity at peak modulation depths. Signal swings 
larger than 3V (p-p) should be avoided since this will pro- 
duce relatively large AC/DC current ratios in the modulator 
and the resulting modulator non-linearities can cause a 
920 kHz beat between the chroma and sound carriers. 


Although only one video input is required, the LM2889 has 
two balanced R.F. modulators and two R.F. carrier frequen- 
cy oscillators. Selection of the carrier frequency is by dc 
switching the supply voltage to the relevant oscillator tuned 
circuit. This automatically shuts off the other oscillator and 
modulator circuits. For test purposes when an output R.F. 
VSB filter isn’t used, or when only one carrier frequency is 
needed, the output pins 8 and 9 can be wired together with 
a common load resistor. Providing two channel operation 
with two independent oscillator/modulator circuits is much 
superior to using a single modulator and attempting to 
change carrier frequency by switching the tuning compo- 
nents of a single L-C oscillator. The latter method involves 
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FIGURE 7. LM2889 R.F. Modulator and Oscillator (one channel) 
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Video Modulation (continued) 


use of isolating diodes (if unbalanced operation with attend- 
ant feed through problems is to be avoided) and expensive 
trimmer capacitors for tuning the second carrier frequency. 
A further disadvantage is the need to switch the VSB filter at 
the R.F. output. 


The LM2889 oscillator configuration is the familiar cross 
coupled differential amplifier type, with level shifting zener 
diodes used to prevent the transistors from saturating with 
large oscillator output swings. The oscillator frequency is set 
by the tuned circuit components (f = 1/27 VLC), and the 
load resistors connected to the supply will set the oscillation 
amplitude and drive level to the modulators as well as deter- 
mining the circuit working Q. 


As might be expected, there are conflicting requirements on 
the practical range of working Q’s. A high Q is desireable 
from the viewpoint of stability, but higher working Q’s (set 
mainly by larger load resistors) increase the drive level to 
the modulator. Above 350 mV (p-p) the modulator will have 
attained full conversion gain and the R.F. output level will be 
determined by the amplitude of the video input signal. Un- 
fortunately increased drive levels will also increase the carri- 
er frequency second harmonic output from the modulator. 
Although a fully balanced design is used, parasitic capaci- 
tances on the emitters of the switching transistor pairs will 
rectify the oscillator waveform and this produces high levels 
of second harmonic. Load resistors much larger than 2400 
can produce a level of second harmonic matching the fun- 
damental. Since relatively small load resistors are required 
(much smaller than the tuned circuit dynamic resistance) 
the working Q will be dominated by these resistors. _ 

The acceptable degree of frequency stability will depend on 
the intended application, but L-C oscillators have proven to 
be adequate for most purposes. We can gain an idea of the 
frequency stability that is possible by considering the fre- 
quency drift produced by changes in the oscillator internal 
phase. A change in internal phase shift can be caused éi- 
ther by temperature or supply voltage changes but, as the 
M2889 data sheet shows, the supply voltage dependency 
is low. Between 12V and 15V the frequency is essentially 
constant and changes by less than 30 kHz over the entire 
supply voltage range. With temperature, the internal oscilla- 
tor phase shift changes by about 2 degrees over a 50 de- 
gree Celcius temperature range. If the tuned circuit Q is 15, 
then at 61.25 MHz (Ch 3 pix carrier) the oscillator frequency 
must change by —92 kHz to produce a compensating 2 
degree phase shift. If the Q is 30, then the ac would 
change by less than — 45 kHz etc. 


For high circuit Q, a large capacitance is desireable, but the 


inductor cannot be made too small if it is to remain the 
tuning element. This keeps the practical range of capaci- 
tance values to between 50 pF and 100 pF. Using a 75 pF 
capacitance, at 67.25 MHz the required inductance is just 
under 0.08 H and the working Q is 15 with 2400 resistors 


connected on either side of the tuned circuit to the supply — 


voltage. Depending on the coil type, the number of turns for 


this inductance will be from 11/ to 31 giving over 10 MHz 


tuning range. This is more than enough to compensate for 
component tolerance and variations ine overall internal 
phase lag from !/C to I/C. 


If better frequency stability of the carrier equeneys over that 
provided by an L/C circuit is needed, then crystal control of 
the oscillators can be used. It is necessary to retain the 
inductor, since a dc short is required across the oscillator 
pins to avoid a collector current imbalance off-setting the 


oscillator differential pair and preventing start-up. The induc- 
tor value is chosen to resonate with the capacitor in series 
with the crystal at slightly less than the desired operating 
frequency. About 20% less will allow the inductor to be fixed 
tuned. Close to its series resonant frequency (normally the 
3rd overtone) the crystal will provide the additional inductive 
reactance necessary for the circuit to oscillate. The equiva- 
lent resistance of the crystal at the operating frequency will 
affect the tuned circuit Q and hence the peak-to-peak drive 
to the modulator circuit. Smaller capacitors in series with the 
crystal (with corresponding changes in the inductor value) 
will push the operating frequency closer to anti-resonance 
and produce large equivalent resistances dropping the os- 
cillator drive level. Larger capacitance values cause the op- 
erating frequency to approach series resonance and a lower 
equivalent resistance (approaching Rg for the crystal, which 
is of the order of 402 to 10020 at 60 MHz). This can produce 
higher drive levels but risks operation at the lower over- 
tones. To prevent lower frequency oscillation a resistor can 
be connected across the crystal. Also a small resistor in 
series with one of the collector leads will form a low pass 
filter with the output capacitance and suppress spurious os- 
cillations at higher frequencies. If this is needed, resistor 
values less than 309 should be used, so that dc offsets will 
not prevent the oscillator from starting. For the circuit of 
Figure 8, capacitor values between 20 pF and 56 pF, with 
the appropriate inductor value, work well with only slightly 
reduced oscillator drive compared to the conventional L/C 
Circuit. 


TL/H/8452-8 
FIGURE 8. R.F. Crystal Oscillator Circuit 


~The Sound Carrier Oscillator 


Before moving to the R.F. output and the VSB requirements, 
-we need to look at another signal that will be added to the 


baseband video—the aural intercarrier. Both the LM1889 


-and the LM2889 have L-C sound carrier oscillators operat- 


ing at 4.5 MHz. Frequency modulation of the LM1889 sound 
oscillator is achieved by an external varactor diode which 
alters the tuning capacitance in response to the amplitude 
of the audio signal. The LM2889 has a similar tuned L-C 
oscillator but the frequency deviation is obtained by internal- 


_ ly phase shifting the oscillator current. This is done by a low 
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pass filter connected to the oscillator which provides a lag- 
ging phase voltage component of the oscillator waveform at 
the input to a differential amplifier. The current output from 


Sound Modulation 


this amplifier is controlled by the audio signal amplitude so 
that more or less of the current (now in quadrature to the 
original oscillator current) is added back to the tuned circuit 
producing the desired shift in the output frequency. Phase 
offsets of up to +12 degrees with increasing audio input 
levels will yield very low audio distortion (less than 0.2%). 
Also the use of a lagging oscillator waveform component 
reduces harmonic levels within the oscillator and a reduced 
possibility for undesired signals contaminating the R.F. 
waveform. 


The tuned circuit operating Q is important in two respects. 
Similar to the R.F. oscillator tuned circuits, the 4.5 MHz 
tuned circuit should have a high loaded Q for stability, but 
the circuit bandwidth must also be wide enough to accom- 
modate the FM sidebands produced by the audio modula- 
tion. For a maximum frequency deviation (Af) and maximum 
modulating frequency f, the minimum bandwidth is given by 
Equation (1). 
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B-W > Af (2.5 + 4f/Af) (1) 


The other requirement is that the maximum phase deviation 
of the oscillator current is able to produce the maximum 
frequency deviation (Af) of the carrier. This is given by 
Equation (2). 


Af = 4.5 x 106 x 0.12/Q (2) 
TL/H/8452-9 Table | summarizes the results of calculating the maximum 
FIGURE 9. LM2889 Sound Carrier FM Modulator circuit Q that satisfies Equations (1) and (2) for the various 
: monaural sound modulating standards used in the U.S. and 
Europe. | 
TABLE | 


Modulation 


She Bandwidth | Modulation | Deviation 
Mono 25 kHz 125 kHz <36 <21 
i eee 73 kHz 400 kHz <12 <7.4 
Pang eas mean acces 


i 


FIGURE 10. LM2889 4.5 MHz Sound Oscillator and Modulator 
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Sound Modulation (Continued) 


Clearly the deviation phase offset dominates the circuit Q 
requirement. | 


lf we choose a Q of around 10 then the oscillator drift with 
temperature (assuming a 2 degree phase change in oscilla- 
tor current with a 50 degree rise in temperature) is of the 
order of —9 kHz. A typical receiver will generate less than 
3% distortion at peak deviations with this much frequency 
drift but if better performance is required, then the circuit Q 
can be raised. High modulation linearity will still be retained 
with a Q of 20 and the oscillator maximum frequency drift 
will be halved. Alternatively temperature compensated tun- 
ing capacitors can be used (between N20 and N75). When 
higher circuit Q’s than 20 are employed, increased audio 
input levels will produce the desired peak frequency devia- 
tions but with the possibility of increased modulation distor- 
tion. The actual operating parameters that are selected can 
be balanced between distortion as a result of modulation, 
and distortion in the receiver circuits as a result of oscillator 
frequency drift. 


To ensure that we have a sufficient 4.5 MHz oscillator level 
to provide enough drive to the internal phase shift circuit, 
the load impedance at pin 13 should be greater than 3.5 kf. 
A second requirement is that we have enough oscillator lev- 
el to generate the desired aural carrier amplitude when 
modulated on the picture carrier. This means that load im- 
pedances greater than 6 kQ. are desireable. At 4.5 MHz, a 
typical oscillator coil of 23 wH will have an unloaded Q of 55 
and tune with 55 pF. For a working Q of 10, the external 
damping resistor is 7.5 kf. 


Stereo Sound 


The introduction in the U.S. of a multiplex stereo sound sys- 
tem (the BTSC system combining the Zenith MCS proposal 
with dbx noise reduction in the stereo difference channel) 
with peak carrier deviations in excess of 73 kHz puts even 


larger constraints on the tank circuit Q. Following the same _. 


rules as before, the maximum allowable Q for low distortion 
is now less than 7.4—with a loaded Q of 5 being likely. With 
this loaded Q, maintaining a carrier center frequency accu- 
racy better than 5 kHz with an L/C circuit becomes impracti- 


cal and other methods to set the oscillator frequency must 
be used. Since a crystal will provide the necessary tempera- 
ture and voltage stable reference frequency a PLL is a use- 
ful solution (see Figure 77). Either the widely available 3.58 
MHz crystals or a 4.5 MHz crystal can be used, but in either 
case, the L/C tank circuit frequency must be divided down 
before application to the phase detector. This is because 
frequency modulation of the sound carrier will produce 
many radians of phase deviation at the phase detector in- 
put—for a modulation frequency of 100 Hz and a peak devi- 
ation of 73 kHz the carrier phase change is given by Equa- 
tion 3. 


6 = Af/fm = 73 X 103/100 = 730 rads (3) 


Since the linear input range of most phase detectors is less 
than 2 7 radians, the modulated carrier input must be divid- 
ed down by at least 233 to keep the phase.deviation within 
this linear range. For a 4.5 MHz crystal, the reference fre- 
quency divider M and the sound oscillator divider N are the 
same. Available ripple counters such as the 74HC4040 and 
74HC4060 can easily divide by 128 (for monaural) or by 256 
for stereo. If a 3.58 MHz crystal is used the M:N divider ratio 
is 35:44 requiring substantially more packages, and the odd 
numbered divider must be followed by an even divide of 2 or 
4 to “square up” the input waveform to the phase detector. 
Also, since the video will include a chroma subcarrier, good 
isolation is needed to prevent the reference oscillator beat- 
ing with the chroma sidebands. 


A suitable phase detector is the 74C932 Exclusive- Or type 
with a sensitivity of 1.6 volts/radian. The filter at the detec- 
tor output prevents the input modulation from reaching the 
varactor diode and distorting the audio. Even so, the loop 
filter must have some ac bandwidth for a reasonable acaqui- 
sition time and other dynamic characteristics. The compo- 
nents shown in Figure 17 have been chosen such that with 


_ a varactor sensitivity of 100 kHz/volt the loop has a hold-in 
- range of over + 150 kHz, with a lock-up time of less than 0.5 


seconds. The T.H.D. is less than 1% for a 400 Hz modulat- 
ing frequency producing 25 kHz deviation of the carrier. The 
accuracy of the sound carrier frequency is, of course, that of 
the crystal used for the reference oscillator. 


TO PIN 13 
lo2 74HC4040 “ | 
lcg  74HC4060. — 


160K. Mv1404 


Vec e 
+5V 


15pF 


~T 


FROM PIN 13 i. 
LM2889 " 


FIGURE 11. 4.5 MHz Crystal Reference Circuit 
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Audio Processing For Sound 
Carrier Modulation 


With the proper tuned circuit Q (see Table |), a linear in- 
crease in the amplitude of the audio signal will produce a 
correspondingly linear increase in the frequency deviation. 
Television receiver sound circuits in the U.S. have a 75 ps 
de-emphasis and in Europe frequencies above 3.2 kHz 
(50 ws) are de-emphasized at a 6 dB/octave rate. This is 
done to help improve the S/N ratio of FM reception and the 
transmitter incorporates the complementary pre-emphasis 
characteristic—above 2.1 kHz the audio frequencies are 
boosted at a 6 dB/octave rate. The consequence of this 
modulation scheme is that if a 0 dB peak signal amplitude at 
15 kHz is capable of producing a 25 kHz deviation than a 
similar amplitude signal at 400 Hz will produce a peak devia- 
tion of only 3 KHz—a loss of some 18 dB in S/N ratio for the 
midband frequencies. Broadcasters usually employ com- 
pressors to enable high modulation levels to be obtained at 
mid-band frequencies without overmodulating high frequen- 
cies. If the audio input to the LM2889 is being sourced from 
an original broadcast (a scrambled signal decoder output for 
example) than this audio—without de-emphasis—can be di- 
rectly applied to pin 1 of the LM2889, and the overall input 
level is adjusted so that the modulation limits are not ex- 
ceeded except for brief intervals (less than 10 instances per 
minute). When the audio has not already been processed a 
different set of conditions will apply and an audio pre-em- 
phasis network is required at pin 1. 


15 
RELATIVE 
AMPLITUDE 

(AB) 19 


1 
T=RC= 91x 109 


TL/H/8452-—12 
FIGURE 12. Audio Pre-emphasis 


Since the audio source is likely to be at a relatively low 
impedance (a pre-amplifier output), the pre-emphasis net- 
work will also be used to attenuate the level of the average 
audio input to the LM2889 as well as providing a relative 
boost to the higher frequencies. The input sensitivity of the 
audio modulator is 150 Hz/mV which means. that 
118 mVrms will give a peak deviation of 25 kHz. 


Next we have to decide what signal frequency and ampli- 
tude to use in calibrating the audio input. Unfortunately the 
75 ps time constant for FM broadcasting was chosen at a 
time when equipment limitations meant there was relatively 
low spectral energy at higher frequencies. Today, modern 
audio material is not well suited to boosting above 2.1 kHz 
since energy peaks at only —6 dB can be obtained at 
10 kHz. A further complication is the ability of the audio level 
meter to predict high energy peaks. If a conventional VU 
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meter is used, peak levels of + 10 dB are possible while the 
meter is indicating OVU. Obviously without processing the 
audio to keep it within predetermined limits, the input level 
calibration will be somewhat empirical in nature. 


If we assume the decrease in spectral energy above 10 kHz 
is such that overmodulation peaks above this frequency are 
unlikely to occur, then we can allow a signal at 10 kHz to 
produce full modulation deviation. Since the amplitude of 
most audio signals at 10 kHz is at least 6 dB below the 
midband frequency level, we can calibrate the audio input 
with a —6 dB amplitude, 10 kHz tone to produce 100% 
deviation. As we shall see later, a frequency close to 10 kHz 
will make the measurement of actual peak deviations very 
easy indeed. With the standard pre-emphasis network, at 
signal frequencies less than 2 kHz, the modulating signal 
amplitude at pin 1 will be —8 dB below the anticipated peak 
10 kHz level producing 100% modulation. This corresponds 
to a modulator input level of 118/2.2 = 45.4 mVrms. The 
45MHz 


INCREASING MODULATION 


A F=25KHz 


APPROACHING CARRIER NULL 
TL/H/8452-13 
FIGURE 13. FM Spectrum with Increasing 
Audio Amplitude (f mod = 10.4 kHz) 4.5 MHz 
Sound Carrier Level 


cOP-NV 


AN-402 


Audio Processing For Sound 


Carrier Modulation (continued) 

input resistance at pin 1 is 1.5 kN so R1 = 30 kO, if we 
assume an input source level of 1 Vrms at 400 Hz. For a 2.1 
kHz breakpoint, C = 0.0027 pF. 

Anyone who has observed the output from an FM circuit 
with a spectrum analyzer will know that for a fixed modulat- 


_ ing frequency the output spectrum will consist of the carrier 


frequency component and sidebands spaced by the modu- 
lating frequency from the carrier. As the modulation ampli- 
tude is increased (the modulation index m becomes larger), 
the carrier decreases to a null and then increases again. 
The modulation indices for which carrier nulls occur can be 
calculated and for our purposes it is important to know that 
the first carrier null occurs at m = 2.4048. For a system 
maximum deviation of 25 kHz the moguialng roan ns fis 
given by: 


f = 25 < 103/2.4048 = 10.4 kHz (4) 


Therefore, if we use an input frequency of 10.4 kHz, as the 
input amplitude is increased the first carrier null will indicate 
peak deviation. If we continue with our assumption of a 
—6 dB level at 10 kHz, calibration consists of adjusting the 
audio input so that a —6 dB, 10.4 kHz signal causes the first 
carrier null. With the above pre-emphasis network, this 
should correspond to 500 mVrms at 10.4 kHz. 


We have already looked at the tuned circuit parameters at 
pin 13 in terms of deviation linearity and oscillator stability. 
With a working Q of 10, the effective load at pin 13 is 
6.2 kQ.. The oscillator current is.0.45 mA so that the output 
amplitude at 4.5 MHz is 3.6V (p-p). Some portion of this 
oscillator signal level is coupled over to pin 10 to set the 
sound carrier level and this can be done by splitting the 
external 7.5 kX. damping resistor into two parts. The picture 
carrier level is set by the offset voltage between pins 10 and 
11 as described earlier. For a 2V (p-p) video signal this off- 
set is 2.3V. Since the 4.5 MHz signal will be ac coupled over 
to the bias pin 10, it will amplitude modulate the picture RF 


carrier. This is conventional AM and a 4.6V (p-p) signal will 
yield sound carrier sidebands at —6 dB relative to the pic- 
ture carrier. If we require a sound carrier amplitude at 
— 17 dB, the signal coupled to pin 10 must be 11 dB below 
4.6V (p-p), or 1.3V (p-p). This is obtained by using a 4.7 kO. 
resistor coupled through a 0.1 F capacitor to pin 10, anda 
second 2.7 kf. resistor connected to the wiper arm of the 
potentiometer used to set the video modulation depth. The 
effect of the potentiometer setting on the aural carrier level 
is eliminated by a.0.1 wF capacitor connected from the wip- 
er arm to ground. However, since the impedance presented 
by the potentiometer will, for all practical purposes, be rela- 
tively constant, the capacitor could be removed and the par- 
allel resistance of the upper and lower arms of the potenti- 
ometer network used to provide the second resistor of 
2.7 kQ. If the video input level is well controlled, it may be 
possible to replace the potentiometer with a fixed divider. 


The final part of the design concerns the output stage, and 
involves meeting the constraints applied by any regulatory 
agency. In the U.S., apart from the need to restrict the peak 
carrier output level to less than 3 mVrms in 750, we have 
two signals present in the output whose level will exceed 
the spurious emission limit of —30 dB with respect to the 
peak carrier level. One of these signals is the result of am- 
plitude modulating the 4.5 MHz intercarrier audio on the pic- 
ture carrier. Apart from the desired —17 dB sound carrier 
amplitude (upper sideband) an equal amplitude lower side- 
band will be present. For channel 3 this is at a frequency of 
56.75 MHz—which is 250 kHz outside our channel lower 
limit. Therefore we need to provide at least 13 dB more 
attenuation at this frequency in the output filter. The second 
unwanted emission (or emissions) is the result of carrier fre- 
quency harmonics—specifically the 2nd harmonic level pro- 
duced by high modulator drive. To suppress this, from 
—18 dB to —30 dB attenuation at 123 MHz is required. 
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FIGURE 14. Audio Intercarrier Coupling to the Video Modulator R.F. Output and V.S.B. Filter 
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Carrier Modulation (continued) 


With a properly constituted baseband signal modulating the 
carrier, these are the only intrinsic unwanted emissions we 
are concerned with. Normal video modulation components 
appearing in the lower sideband will not have sufficient am- 
plitude and do not extend beyond the lower channel limit. 
Even so, the filter requirements are not trivial. 


lf L-C filters are used, this can be done with three coils per 
channel but some alignment procedure will be required. For- 
tunately SAW filters are available from several sources 
which, although more expensive than the equivalent L-C fil- 
ter, avoid the cost of production alignment. Usually the SAW 
filter will have a substantially greater insertion loss, but the 
LM2889 has enough output level to compensate for this. 
Both single channel and dual channel filters are available 
and in the latter case the LM2889 dual oscillator/modulator 
configuration enables easy dc switching between channels. 
A coil may be required, connected across the SAWF input, 
to tune out the SAWF input capacitance. 


The load resistors connected to pins 8 and 9 will set the 
LM2889 conversion gain, which for 750 is_ typically 
20 mVrms R.F. carrier per volt offset at the input pins 10 
and 11. The actual load will include the input resistance of 


12V 


LM 2889 


INDUCTORS yH 
CAPACITORS pF 


the filter. Since the output of the filter will normally be termi- 
nated in 752. to match the cable (and provide triple transit 
echo suppression for a SAWF), the best way to choose the 
load resistor is to monitor the output to the cable and apply 
a dc offset between pin 10 and 11 that is equivalent to the 
expected video input. The resistor is then chosen to give the 
desired peak carrier level of 2.5 mVrms. The carrier should 
be unmodulated since downward modulation will reduce the 
mean carrier level by as much as 2-3 dB. 


If the offset voltage between pin 10 and 11 is reduced, a 
check can be made on the residual carrier level at the out- 
put. This residual level is the result of oscillator feedthrough 
in the modulators and external coupling from the oscillator 
tuned circuits. The residual carrier level is normally better 
than —26 dB below the peak carrier level, ensuring good 
modulation linearity. High levels of residual carrier can be 
caused by coupling through ground or power supply leads. 
A good technique to minimize the effect of unwanted pick- 
up is to decouple the supply voltage to pin 8 and 9 load 
resistors over to the output connector shield ground. This 
removes at the output any carrier signal on the supply line to 
the load resistors. 


CHANNEL 3 FILTER 
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FIGURE 15. Vestigial Sideband Filters 
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FIGURE 16. R.F. Decoupling at the Output yee 
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FIGURE 17. Complete R.F. Modulator External Circuit 
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A 16-Bit Video Shift 
Register with On-Board 
FIFO Operates at Rates Up 
to 350 Million Pixels Per 
Second 


INTRODUCTION 


The evolution of graphics systems has, in part, been driven 
by the requirements for higher system resolutions. High per- 
formance raster scan graphics systems often require a res- 
olution that exceeds 2k x 2k pixels. For non-interlaced ras- 
ter scan monitors with a vertical refresh rate of sixty frames 
per second, the above screen size translates to a pixel rate 
of 300 MHz. Many systems being designed today require 
pixel rates significantly exceeding 100 MHz. Although this 
high data rate is only present over a small portion of the 
graphics subsystem, implementation requires significant 
modification to the architecture and hardware of the graph- 
ics subsystem. 


SYSTEM OVERVIEW 


Figure 7 shows a simplified block diagram of a graphics 
subsystem. The system processor interacts with the graph- 
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ics processor via the address and data buses and transfers 
the data to the display memory. The graphics processor 
controls the graphics subsystem; its functions include imple- 
menting the drawing algorithms, providing the CRT format 
(sync, blank, and size), and generating the addresses for 
the display memory. A BITBLT processing unit, such as Na- 
tional Semiconductor’s DP8511 implements all of the classi- 
cal BITBLT (Bit boundary Block Transfer) functions. Under 
the control of the graphics processor, it performs all the 
data path operations. The video shift register accepts the 
data from memory (typically dynamic RAM or video RAM) in 
parallel form and converts it to high speed serial data for the 
video DAC. The timing for this subsystem is provided for by 
a high performance video clock generator such as Nation- 
al’s DP8512. 
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To meet the requirement of a pixel rate exceeding.100 MHz, 
graphics designers have to rely upon ECL logic circuitry that 
is costly, in PC board space, power consumption, and com- 
ponent cost, even though the remainder of the system oper- 
ates at much lower data rates. Since it is unnecessary and 
economically impractical to build the entire graphics system 
in ECL logic, conversion circuitry is required to translate TTL 
logic levels to ECL logic levels. Furthermore, since most off- 
the-shelf ECL circuits and TTL to ECL translators operate 
with a negative supply voltage, an extra, negative power 
supply is required. ; 

Design of multiple plane/board systems becomes especial- 
ly difficult when high pixel rates are employed. The high 
pixel rates translate to high shift register parallel load rates. 
Looking at Figure 7, in multiple board systems, proper timing 
between data from memory (shift register parallel input 
data) and the load clock is difficult to maintain especially 
since data is arriving at the shift register from various boards 
while the clock (DP8512) is originating from one source. 
Even if variable delays are employed for adjusting the timing 
of the data arriving at the shift register, the total delay can- 
not exceed one load clock period without employing a FIFO. 
Again, looking at Figure 7, from the appearance of a clock 
signal on the bus, data from memory on one of the planes 
will not arrive at the shift register until two receiver delays, 
one RAM access time, and one driver delay later; at high 
data rates this could easily exceed one load period. 


DP8515/16 EASES SYSTEM DESIGN 


National Semiconductor’s DP8515/DP8516 was designed 
with several key goals in mind. Most important was the goal 
to ease the system designer’s job of designing the parallel 
to serial conversion section of the graphics subsystem. The 
second goal was to reduce the cost, power consumption, 
and board space currently required for the parallel to serial 
conversion in mid- to high-performance graphics systems. 
The final goal was to develop a flexible part, one that could 
easily be configured to fit well in any graphics system. 


6 WR INPUT 


The DP8515/DP8516 Video Shift Register integrates the 
functions of a 16-bit 350 MHz shift register, ail the TTL to 
ECL translators, and a 4-word deep FIFO. The integrated 
circuit is designed to operate with a standard TTL supply 
and optionally either a positive or negative ECL supply. The 
DP8515 meets both MECL 10K and F10K specifications 
while the DP8516 is both MECL 100K and F100K compati- 
ble. Interfacing the DP8515/DP8516 with a video DAC is 
made easy with the differential outputs SO/SO and S$8/S8. 
With both the last bit and the 8th bit shift register outputs 
available, the integrated circuit can be used as two shift 
registers in an 8-bit system, thus cutting the component 
count in half. As will be shown later, the two outputs result in 
a lower component count for various word length systems. 


System expansion to 32-bit or other word lengths is possible 
with the DP8515/DP8516 by using the shift register serial 
inputs, SI/SI. All of the ECL inputs are differential in order to 
improve noise immunity; however, for single-ended use, one 
of the differential inputs can be wired to an on-chip refer- 
ence, Vpp, that is compatible with standard ECL levels. 
The maximum pixel rate is guaranteed to be a minimum of 
350 MHz. This is achieved with a relatively low supply cur- 
rent of 130 mA through the use of National Semiconductor’s 
combination Bipolar/CMOS process. 


FIFO EASES TIMING PROBLEMS | 


_ As mentioned before, in mid- to high-end graphics systems 


DATA INPUTS 


the pixel rate may range from 100 MHz to over 300 MHz. 
The timing problems discussed above have been addressed 
by the DP8515/DP8516. The DP8515/DP8516 contains an 
on-board four word by sixteen bits FIFO. This FIFO is 
placed after the input latches and in front of the shift regis- 
ter as is shown in Figure 2. By setting two input pins, MO and 
M1, the part may be configured in three separate ways: the 
FIFO in front; the FIFO bypassed with the data inputs con- 
figured as transparent latches; or the data inputs configured 
as edge-triggered flip-flops. 
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FIGURE 2. DP8515 Video Shift Register Block Diagram 
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With the inclusion of the four word FIFO, timing problems 
between the data from memory and the shift register load 
clock have been eliminated. Figure 3 shows a multiple plane 
graphics subsystem. The timing for this system is being pro- 
vided by two of National Semiconductor's video clock gen- 
erators, the DP8513 (multiple board version of the DP8512), 
and the DP8514. As mentioned above, data will appear at 
the video shift register inputs significantly delayed from the 
arrival of a clock signal, LCLK1, on the bus. This delay is 
comprised of the LCLK1 receiver delay (on PO for example), 
the video RAM access time, the driver delay for the video 
RAM data going onto the bus, and the receiver delay for the 
video RAM data coming onto the board containing the video 
shift register. In high pixel rate systems, this delay may easi- 
ly exceed the period of the data being input into the video 
shift register. A setup or hold time violation could then occur 
which may cause data to be lost or appear at the wrong time 
on the screen. 


This timing problem is easily solved with the FIFO on the 
DP8515/16 Video Shift Register. First, by running the video 
shift register write clock to the various boards containing the 
video RAMs, any potential skew between the shift register 
input data and its write clock is eliminated. The fact that the 
total delay in the data path may exceed one data period is 
no longer a concern since the FIFO will act as a buffer. 


With a data rate of 14 MHz (71 ns period) and a total delay 
of 100 ns, the system designer would have severe problems 
without a FIFO. Using a delay line, he might try to delay the 
shift register parallel load by the same amount as the data, 
100 ns, however the parallel load needs to maintain a well- 
defined relationship with the pixel clock. For a pixel rate of 
224 MHz (14 MHz data rate and 16 bit word), a setup time of 
about 2 ns is necessary for the parallel load. it is very diffi- 


DATA 


cult to maintain this setup time while delaying the parallel 
load by 100 ns. Another possible solution might be to delay 
enabling the shift register parallel load by two data periods 
since the data is delayed by approximately one and one-half 
periods. Since no delay is inserted in the shift register paral- 
lel load path, the setup time to the pixel clock is maintained. 
The problem with this approach, also present in the first 
approach, is that the data path delay must be known over 
temperature, supply, and part-to-part variations. If the data 
delay of one and one-half periods increases to more than 
two periods or reduces to less than one period the data is 
lost since a shift register parallel load will not occur during 
that period. 


When using a FIFO, this problem does not exist. The en- 
abling of the shift register parallel load can be delayed by up 
to four data periods. As long as the data delay does not 
exceed four periods, no data is lost since data will be written 
into the FIFO as soon as data and write clock reach the 
FIFO. Any data delay variations can be accommodated by 


‘the FIFO. 


Figure 4 shows a possible timing scheme for the Video 
Clock, Video RAM, and Video Shift Register interface. As is 
shown in the figure, time period 2 is used to write the data 
from the Video RAM into the Video Shift Register and this is 
the time which will vary depending on the delays. Whether 
the delay in the data path is less than one data period or 
greater, there is no problem since data will not be read from 
the FIFO into the ECL shift register until time period 3. It is 
of no concern to. the ECL shift register whether one, two or 
three WR clocks, along with data, occur before the Parallel 
Load, PL, because this will just vary the number of words 
written into the FIFO. Although the FIFO is absolutely es- 
sential for high speed systems, it will significantly ease de- 
sign in any system. 
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FIGURE 3. Multiboard Graphics Interface 
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FIGURE 4. VCG/VSR/VIDEO RAM Timing Diagram 


SCROLLING SIMPLIFIED WITH CHIP 


Two pins on the DP8515/16, OUTPUT CONTROL and 
OUTPUT LEVEL CONTROL, allow the user to manipulate 
the shift register data if he so chooses. With the OUTPUT 
CONTROL input at a high level, the last bits of the shift 
register, that is the SERIAL OUTPUT, SO, and the S8 bit, 
are prohibited from shifting while all of the other bits are 
being shifted by the pixel clock. This has the effect of per- 
forming a basic scrolling function on the screen. The OUT- 
PUT LEVEL CONTROL input determines to what level the 
last bit is set. Even though the eighth bit is held at the out- 
put, internally information will continue to be shifted through 
this bit when using the OUTPUT CONTROL function. 


When doing a horizontal scroll, the user wants to be able to 
shift the data appearing at the beginning of the screen by a 
certain number of bits. Ideally the data can be shifted in 
single-bit increments thus giving the perception of smooth 
scrolling rather than the “jumping” data which results from 
scrolling in word length increments. Since data is read out of 
memory in whole words, the user cannot perform a shift less 
than a word length, by changing the data read out of memo- 
ry. He must manipulate the data coming out of the shift 
register which is costly in terms of component count and 
board space, or he must rewrite the data in the buffer mem- 
ory which is costly in terms of required time to perform the 
operation. 


Figure 5 shows how DP8515/16s can be used to perform 
scrolling. The DP8512 video clock generator supplies a vid- 
eo enable signal at the beginning of each screen line and a 
video shift register parallel load signal, LCLK, for each data 


word. In the circuit shown in Figure 5, the video enable sig- 
nal is slightly delayed with respect to the LCLK signal. Thus, 
the output of the D flip-flop being input to the OUTPUT 
CONTROL input of the video shift registers will occur up to 
one word after the. shift register parallel load signal for these 
registers. The LCLK signal from the DP8512 is applied to 
the parallel load input of another DP8515/16 the “Scroll 
Control Register” in Figure 5. This shift register acts as a 
delay for the parallel load signal being sent to the other 
DP8515/16s. The value of the delay is programmed by the 
information loaded into the parallel inputs of the Scroll Con- 
trol Register. The LCLK signal is replicated by this register 
by inputing the appropriate ones-zeroes pattern into the par- 
allel inputs. The delay is achieved by properly positioning 
the beginning of the ones pattern within the input stream. 
Since the video shift register has an on-board four word 
FIFO, this FIFO may be used to replicate the delayed ver- 
sion of the LCLK signal. Also, as a result of the FIFO, the 
input pattern needs to be loaded only once. After the infor- 
mation is in the FIFO it will be cycled over and over. 

Figure 6 is a timing diagram for the circuit in Figure 5. At 
time T1, data is. loaded into the video shift registers and is 
shifted on the subsequent pixel clocks. At this time, howev- 


- er, no data is output from the shift registers to the DACs 
_ Since the delayed video enable signal (OUTPUT CONTROL) 
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is high. At time T2 the delay video enable signal (OUTPUT 
CONTROL input) goes low and data is output from the video 
shift registers. The number of pixel clocks occurring be- 
tween T1 and T2 determine by how many bits the data is 
scrolied. | 7. * 
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SHIFT REGISTER TAP REDUCES COMPONENT COUNT 


The ECL shift.register inputs and outputs have been de- 
signed to accommodate cascading and for ease of intercon- 
nection with DACs. The inputs and outputs may be used 
differentially or single-ended. If single-ended use is desired, 
the on-board Vgp reference is provided for the unused in- 
puts. Differential inputs and outputs significantly ease board 
design by reducing the circuit’s susceptibility to noise picked 
up while routing the traces on the printed circuit board. 


The tap at the eighth bit enables the user to ‘‘customize” 
the part for various word lengths. For an eight bit system the 
component count, and thus power dissipation, is cut in half 
by the availability of the eighth bit; one part may be used for 
two words. Figure 7 shows how the tap results in compo- 
nent count reduction for other word lengths. In this example 
a twenty-four bit word is used. Three video shift registers 
can perform the parallel to serial conversion for two words. 
If the tap was not present, another video shift register would 
be necessary. This results in a savings of four video shift 
registers (12 versus 16) in an eight plane system. This flexi- 
bility reduces the total cost of components, board space, 
and power. 


ADDITIONAL FEATURES ENHANCE FLEXIBILITY 


Two additional features provide the system designer with 
even more control over the operation of the DP8515/16. A 
HOLD input is provided which, when held high, inhibits the 
shifting of all of the bits of the shift register. This is in con- 
trast to the OUTPUT CONTROL input which only inhibits the 
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2 WR 


CLK 
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shifting of the SO and S8 bits. The HOLD input is ECL com- 
patible and may be applied for as short a time as one pixel 
clock. . 

An ENABLE input is also provided on the DP8515/16. 
When held high, this TTL compatible signal will inhibit the 
parallel loading of the data into the shift register. 


ECL POWER SUPPLIES NO LONGER A PROBLEM 


~ As mentioned earlier, the DP8515/16 was designed with 
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several goals in mind; one of these was to develop a flexible 
part that could easily be configured to fit well in any graphics 
system. The DP8515/16 power supplies are specified to 
meet this goal. The DP8515/16 has separate pins for the 
TTL supply and the ECL supply. However, the ECL supply is 
specified so that it may be shared with the TTL supply. Both 
the 10K (DP8515) and 100K (DP8516) versions have all 
specifications guaranteed with supplies varying from 4.2V to 
5.5V. Also the ECL supply may be either positive or nega- 
tive. This allows the user to configure this part around his 
system, not the other way around. Also, the on-chip Veg 
reference eliminates the problem of having to generate dif- 
ferent references for positive and negative supplies if single- 
ended signals are being used. | 


DP8515/16 IDEAL FOR MANY APPLICATIONS IN 
ADDITION TO GRAPHICS 


The DP8515/16 video shift register is well-suited for any 
high-speed application where parallel to serial conversion is 
required. Any high-speed system will have the timing prob- 
lems described above which can be eliminated by placing a 
FIFO in front of the shift register. 
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FIGURE 7. DP8515/DP8516 Interconnection for 24-Bit Word 
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The DP8515/16 is ideal for the generation of any high- 
speed pattern for ECL circuitry. The pattern can be parallel- 
loaded at a low rate into the FIFO. The serial pattern can 
then be shifted out at sixteen times this rate and at ECL 
levels. If the DP8515/16’s are cascaded, higher differentials 
between input and output data rates are possible. A pattern 
of up to sixty-four bits can be stored in the FIFO and, by 
cycling through the FIFO, continuously shifted out of the 
shift register in serial form. If longer patterns are desired, 
the FIFO is periodically updated. The serial data from the 
shift register can be clocked by a synchronous clock source 
which was divided down to generate the write clock for the 
FIFO. 


3-59 


SUMMARY 


The DP8515/16 video shift register has been designed with 
the intent of significantly easing graphics subsystem design 
and at the same time reducing the cost of the paraliel-to-se- 
rial data conversion section of the graphics sybsystem. The 
chip can be configured to meet the needs of the system. A 
four-word FIFO is available and when put in front of the high 
speed shift register, eliminates the serious data timing prob- 
lem which arises when designing multiple board systems. 
This video shift register may operate off of positive or nega- 
tive power supplies, is available in 100K and 10K options, 
and includes features which allow smooth panning and pixel 
replication zoom functions. Also a tap at the eighth bit al- 
lows the chip to operate efficiently with any word length. 
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Mid-Scan-Line Load 
Techniques. ; 
Using the DP8500 Raster 
Graphics Processor | 


A video graphics system’s main task is to perform drawing 
operations with the highest possible performance. In order 
to achieve high drawing rates the drawing processor must 
have access to the frame buffer via the system bus with as 
little interruption as possible from other system activities. 
Unfortunately, all video systems are required to perform 
some level of display refresh which, unless the frame buffer 
is dual-ported, will require the refresh controller to fetch 
data from the frame buffer utilizing the same bus that the 
drawing processor is using. As a result, the drawing process 
will be interrupted in order to perform the display refresh 
operations which will ultimately hinder the drawing perform- 
ance of the system. 


VDRAM TECHNOLOGY 


The bus bandwidth overhead associated with display re- 
fresh has been greatly reduced with the advent of the dual- 
ported Video Dynamic RAM (VDRAM). Shift registers tied to 
a separate video data port have been incorporated into con- 
ventional DRAM devices providing a convenient way to 
transfer frame buffer data to the video output section. Data 
is transferred to the shift registers a row at a time and then 
clocked out of the serial port as needed by the video refresh 
section. The activity of transferring data from the random 
memory array to the shift registers is commonly referred to 
as a data transfer cycle. 


Utilizing VDRAM devices in a RGP-based graphics system 
inherently increases the system bus bandwidth for other bus 
activities not related to display refresh. The time ordinarily 
needed for fetching display data, particularly in a system 
with a frame buffer consisting of conventional DRAM devic- 
eS, Can occupy greater than 50% of the available bus band- 
width. In contrast, a frame buffer consisting of VDRAM de- 
vices in a RGP-based system are accessed for display up- 
date at the most once every 4096 pixels displayed, so the 
system bus is freed up for other system activities, such as 
drawing. 

Display refresh using VDRAM devices is accomplished by 
performing a read cycle on the random access port in the 
data transfer mode (DT/OE before RAS), which enables the 
transfer of a row of frame buffer data to its internal 256-bit 
shift registers. Once the read operation has completed, the 
video data is clocked out of the serial access port with the 


| Shift Clock (SC) input on the VDRAM. The advantage of this 


dual-port arrangement is that the system bus is utilized only 


_ once for the transfer cycle and then the shift registers take 


over delivering the frame buffer data to the display. 


The timing requirements of VDRAMs with respect to the 
data transfer cycle, however, are rather rigid. The rising 
edge of the DT/OE input, which controls when the data is 
loaded into the internal shift registers must occur at a point 
in time when the last bit of data is clocked out of the serial 
port but before the next rising edge of Shift Clock. At video 
rates, this can become difficult to control. 
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SIMPLIFYING THE DISPLAY REFRESH TIMING 


Because of the precise timing requirement required for dis- 
play refresh, it is not uncommon to feed the outputs of multi- 
ple VDRAM devices to a high speed parallel-to-serial shift 
register so that a slower clock can be used to clock to 
VDRAM shift register. This essentially relaxes the window 
for performing the transfer cycle as well as extending the 
time interval between transfer cycles. 


In some systems this solution already exists because of the 
word width imposed by the drawing processor. For example, 
an RGP-based system utilizes a 16-bit data bus which im- 
plies that four VDRAM devices (i.e., 64k x 4 devices) be 
used to form the required data width on the random access 
port. The serial port word width matches the random access 
word width of 16 bits. The period (in pixels) between data 
transfer cycles can be calculated by multiplying the number 
of bits used to create the data word, which, in the RGP’s 
case is sixteen, by the length of the VDRAM’s serial shift 
registers. Using the RGP-based system as an example the 
period can be calculated as 16 < 256 = 4096 bits. 


This effectively creates a longer serial word per transfer cy- 
cle performed, which results in extending the maximum peri- 
od between data transfer cycles to be equal to some multi- 
ple of the horizontal line length. With some display widths, 
for example, a 1024-bit wide format, this would allow the 
data transfer cycle to occur during the blanking period, elim- 
inating the need for generating the precise timing that other- 
wise would be required if performing the data transfer cycle 
during active video. This method provides a reasonable so- 
lution for some system designs which can match the display 
format to the frame buffer’s effective width or some multiple 
of it. 


SIMPLICITY WITH COMPROMISE 


A problem with the above approach is that not all display’s 
horizontal dimensions can conform to the implied bounda- 
ries of the frame buffer. A good case in point is the 1280 x 
1024 display. Given a 16-bit wide RGP data bus and using 
64k by 4 VDRAM devices, a simple frame buffer could be 


-implemented using two banks of 4 devices configured as 


illustrated in Figure 7. Each VDRAM is responsible for pro- 
viding 4 bits of video data which is fed to a 16-bit shift regis- 
ter. This configuration would allow for a data transfer cycle 
to occur at a maximum of once every 4096 pixels displayed 
(16 <x 256 = 4096). 


In order to guarantee that the display refresh cycle does not 
occur during active video, a value for the word offset be- 
tween vertically adjacent pixels in the frame buffer called 
the display buffer warp, must be chosen so that the shift 
registers never become empty before reaching the end of 
the scanline. 
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FIGURE 1. 2k x 1k x 16-Bit Frame Buffer 
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When the RGP generates a display refresh cycle at the start 
of each scanline, the address generated by the RGP is cal- 
culated by adding the display buffer warp value to the dis- 
play buffer starting address each time a new scaniine is 
started. This requires that the display buffer warp be set so 
that successive scanline’s starting addresses can be divid- 
ed into the shift register’s effective length evenly. In this 
case, the display buffer warp is calculated as being 1/¢th (to 
convert to words) the value closest to the horizontal line 
length of 1280 which can be evenly divided into 4096, which 
is 2048. In this case, each time a new scanline is started, 
the new refresh address would be the previous line’s start- 
ing address plus 128. (2048/16). In this way the VDRAM’s 
shift register, which is really 4096 bits long, would never be 
exhausted during active video. | 


An apparent disadvantage of using this type of refresh con- | 
figuration is that it wastes a large percentage of the frame | 


buffer. The memory between pixels 1280 and 2048 on each 


‘scanline cannot be used in a linear fashion because it is in — 


fragments throughout the frame buffer..In this example, a 
total of 48k words of random access memory which could 


be utilized for other system tasks would be forfeited to sim- - 


plify the data transfer logic timing. 


A more efficient design would therefore allow the display 
buffer warp to be set such that the ending address of each 
scanline is adjacent to the beginning address of the next 
scanline in the frame buffer, eliminating the wasted buffer 


area. As a tradeoff the system would be required to perform — 


a new data transfer cycle on demand, regardless of being in 


active video or not, when the VDRAM shift register’s con- . 


tents are about to become exhausted. This operation is 
called a Mid-Scan-Line Load Operation. 


A BETTER SOLUTION USING THE RGP 


The DP8500’s refresh logic was designed with VDRAM — 


technology in mind and provides an easy way to implement 
mid-scan-line load with a minimum of external components. 


The Video Refresh Control (VRC) block of the RGP consists 
of a refresh address counter which is clocked by the LCK 
input, and the timing chains that ultimately produce the sync 
and display refresh request signals. When the refresh coun- 
ter is either clocked to 255 or when the end of a scanline is 
reached, the VRC block generates a signal, DRREQ, which 
indicates that the RGP will perform a display refresh cycle 
when the current bus cycle completes. This signal can be 


used to flag the system memory controller which can gener- 


ate the appropriate data transfer cycle timing in the nen bus 
cycle. 

One LCK cycle after the RGP’s internal refresh counter 
clocks to 255, the RGP will internally request a display re- 
fresh cycle. If there is a bus cycle currently in progress, the 
RGP will continue executing the current cycle, regardless of 


its length, until the WAIT input is sampled high. At this point, . 


the RGP will initiate a display refresh cycle and allow the 
system to perform the VDRAM data transfer. cycle. 

The criteria for performing a successful data transfer cycle 
during active video is to start the cycle as normal by subse- 
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quently asserting DT/OE, RAS and CAS low and then posi- 
tion the rising edge of the DT/OE signal so that it transitions 
after the last word of serial port data is transferred to the 16- 
bit output shift register but before the next rising edge of SC 
(shift clock). This will ensure that the last word of video data 
is not corrupted before being transferred to the 16-bit output 
shift register. 

Because the period between the DRREQ transition and 
when the address counter rolls over is fixed at 1 LCK cycle, 
a problem may arise if the display refresh request from the 
RGP comes during a bus cycle which is longer than the 
period of time that is needed to shift out the remaining 16-bit 
word of display data. There is a possibility that the data 
transfer cycle would occur too late, resulting in an interrup- 
tion in the process of delivering video data to the display. 


This problem can be overcome by delaying the blanking 
signal by ‘‘n” LCK periods which, in effect, allows the RGP’s 
internal refresh counter to start clocking ‘‘n’” LCK periods 
before active video. This causes the RGP to assert the 
DRREQ signal ‘“‘n” LCK periods before the VDRAM shift 
register’s contents are exhausted allowing the RGP ample 
time to complete the current bus cycle and start the display 
refresh cycle. The number of LCK periods to delay the 
blanking by can.be determined by summing up the time re- 
quired to execute the longest bus cycle and the period with- 
in the display refresh cycle just prior to when the DT/OE is 
to be clocked high and dividing that total by the LCK period. 


The period of time between when the RGP asserts DRREQ 
and when the VDRAM shift registers are empty is deter- 
mined by the “‘n” LCK delay of the BLANK signal plus the 
time it takes to perform the display refresh cycle. It is impor- 
tant to note that the timing relationship of DRREQ and 
DT/OE must remain constant. The bus state machine must 
guarantee that this period is maintained regardless of the 
length of the current bus cycle. This can be done by using 
the DRREQ signal to enable a down counter being clocked 
by LCK. The count length is set to match the delay of the 
BLANK signal. The counter’s output is sampled by the bus 
state machine to hold off the completion of the display re- 
fresh cycle by inserting wait states to the RGP during the 
display refresh cycle. The DT/OE signal is set low but is not 
released high again until the counter reaches zero. At this 
point, the bus state machine asserts both DT/OE and WAIT 
high and finishes the display refresh cycle. In this way, the 
period between DRREQ and DT/OE transitions remains 
constant, guaranteeing the transfer of data to the shift regis- 
ter at exactly the right time. 


IMPLEMENTATION 


The logic for the above circuit was implemented in a single 
National 16V8 Generic Array Logic (GAL®) device. The log- 
ic diagram, timing, and GAL equations can be found in Fig- 
ures 2, 3, and 4 respectively. The circuit was tested on Na- 
tional’s DP850EB 4-Plane Evaluation Board. Please consult 
the DP850EB 4-Plane Evaluation Board System User's 
Guide for further information on designing a graphics sys- 
tem around the DP8500 Raster Graphics Processor. 
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module ms14 
title "Mid_scan_line_load logic 


Gary Betz National Semiconductor Corporation, Santa Clara, Ca. 


June 8, 1988 


This device delays blanking by two clock periods. It also contains a 
2-bit counter which is used to hold off finishing the display refresh 
cycle. Please see the timing and schematic diagrams for more details 


on how this circuit is used. 


Icl device 'pl6v8r's . 
| clk | pin 
LE . pin 

I2 pin | 
reset~ pin 
drireq~ pin 
blank ~ pin 
I6 pin 
I7 pin 
latched.drreq~ — pin 
GND . pin 
OE pin 
end_dt_cycle ~ pin 
dr_req~ pin 
a pin 
b 3 pin 
F4 . pin 
F5 pin 
delayed_blank~ pin 
rst_req~ pin 
vec pin 
equations ‘end_dt_cycle = 

fa s= 
oe — 


fdr_req = dr_req~ ; 


oOo ON DOA FWD & 


wo onl oO we we we we we we we we we 


we we we 


PrP eH 


"LCK input. 


"Reset in from RGP. 
"Inverted dr.req~ for flip/flop. 
"Blanking input. . 


"Starts the down counter. 


"Used to Signal state machine to 
"finish the display refresh cycle. 


"Dr.req~ input from RGP. 


"Q—bit counter LSB. 
"2=<bit counter MSB. 


"Blanking delayed by 2 LCKs. 


"Clears the latched dr.ireq~ 


(ja & !b) # !delayed_blank ~ ; 


(a & tlatched_drregq~ ) 
# (I2 & latched_drreq~ ) ; 


(b & 


la & !latched_drreg~ ) 


# (!b & a & !latched_drreg~ ) 
# (!I2 and latched_drreq~ ) ; 


Irst_req~ s= ‘!end_dt_cycle # ‘!reset~; 
ldelayed_blank~ := F5; 


1F5 s= blank~ ; 


end 


FIGURE 4. GAL Equations 
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Accurate Timing for Multi- 
Board Graphics Systems 


Two members of National Semiconductor’s Advanced 
Graphics Chip Set are the DP8513, Multiboard Video Clock 
Generator, and the DP8514, Crystal Clock Generator. 
These devices are used in multiple board/plane graphics 
systems to generate and synchronize the clocks which drive 
the graphics processors and transfer VRAM information out 
to DACs for CRT display updates. Synchronizing the signals 
involved in these operations is becoming increasingly diffi- 
cult as display resolutions increase and as more color 
planes are offered. The higher display resolutions increase 
the rate which information must be transferred from memory 
to the display. The increasing number of color planes force 
VRAM information to reside off of the mother board thereby 
introducing timing constraints associated with back plane 
data transfers. Utilization of the DP8513 and DP8514 great- 
ly simplify these clock generation and synchronization prob- 
lems. 


Updating a CRT display invoives a parallel to serial conver- 
sion of screen information from VRAM memory into a video 
shift register which then serially feeds a DAC. Pixel and load 
clocks are used to coordinate the transfer of the VRAM data 
into the video shift register and also the shifting of the data 
out to the DAC. Present day screen resolutions of 1280 x 
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SYNCHRONIZER/ 
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CLOCK 
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1024 pixels and beyond are becoming commonplace. This 
translates to a pixel rate exceeding 100 MHz for the con- 
ventional non-interlaced raster scan monitor. Manipulation 
of the screen information at these high data rates requires 
that the video shift register and DAC portions of the graph- 


ics system be implemented in high speed Emitter Coupled | 


Logic (ECL). The ECL pixel and load clocks associated with 
these data transfers must syncronize with the slower MOS 
clocks driving video memory and the graphics processors 
so that screen updating, blanking, and scrolling operations 
are properly carried out. The DP8513 and DP8514 generate 
these pixel, load, and processor clocks and provide the ap- 
propriate gating circuitry to simplify interconnection into eéi- 
ther MOS, TTL or ECL based systems. 

The DP8513 generates the pixel, load, and processor 
clocks using a digital phase lock loop as shown in Figure 7. 
The device takes an input reference signal provided by the 
DP8514 and uses it to precisely control a high speed volt- 
age controlled oscillator from which the pixel, load, and 
processor clocks are derived. The phase lock loop aittri- 
butes enable the input reference signal and all the output 
clocks it generates to inherently be phase aligned. 


REF IN 


(FROM DATA OUT) 


REFCLOCK 
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LOAD CLOCK 
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FIGURE 1. DP8513 Functional Diagram 
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The DP8514 is comprised of two independent functions as 
shown in Figure 2: an oscillator with TTL outputs and a syn- 


chronizer/MOS clock driver. The oscillator function provid- 
ed by the DP8514 need only be implemented once in any 
system on any arbitrary board. The oscillator and oscillator 
divided by two outputs, X OUT and DATA OUT, are intena- 
ed to be connected out to the backplane to drive all boards 
identically. (In a system where it is desired to lock to an 
external video source, this oscillator function should reside 
on the motherboard. In this configuration, the DP8513’s PLL 
function can interact with the DP8514’s crystal/ceramic os- 
cillator to phase align itself to an external horizontal input 
signal.) 


X OUT 
2 


DATA OUT. 
(  ) 


REF IN 


MOS 2 PHASE (FROM DATA OUT) 


CLOCK DRIVERS | 


REFCLOCK 


FROM X OUT 
SYNCHRONIZER/MOS DRIVER | ( 


FIGURE 2. DP8514 Functional Diagram 


The DP8514’s_ synchronizer/MOS clock driver function 
would be required on every memory board in a system. The 
synchronizer function receives the DP8514 reference sig- 
nals, X OUT and DATA OUT, off of the backplane and pro- 
vides controlled duty cycle 2 phase MOS clock drive for the 
DP8510 or DP8511 Bit BLT Processing Units associated 
with each color plane. The DP8513 incorporates a synchro- 
nizer/MOS clock driver function identical to the DP8514’s 
which is used to drive the Raster Graphics Processor. This 
configuration enables the RGP to maintain phase alignment 
with each of the remotely located Bit BLT units it com- 
mands. 


Increasingly higher screen resolutions require higher pixel 
and video shift register load clock rates which in turn push 
the VDRAMs to be operated near their maximum specified 
frequency. In a multiple board system the video shift regis- 


ters and DAC’s must be contained on the mother board | 


while their VDRAM data is received through a backplane 
from another board. This configuration is necessary be- 
cause the ECL signals, which the video shift registers and 
DACs required, aren’t suited for distribution out through the 
backplane. Clocking VDRAMs from a source on the mother- 
board and receiving data back from them through the back- 
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plane presents some significant timing problems. The back- 
plane introduces clock waveform distortions and driver/re- 
ceiver delays which limit data throughput. The total delay 
associated with the transfer of VDRAM data to the video 
shift register must not exceed one load period. This opera- 
tion involves clock and data signals propagating through 
two receivers, one VDRAM access time, and one driver. As 
the load clock cycies approach 40 ns, satisfying the timing 
requirements becomes a significant challenge. 


The DP8513 and DP8514 address these multiple board tim- 
ing problems associated with the backplane. The DP8513 
provides several different gated load clock outputs which 
help to eliminate the one load clock transfer constraint. The 
DP8514 contributes by providing a means of reducing the 
reference clock phase skews between boards and eliminat- 
ing concerns about the clock waveform distortions intro- 
duced by the backplane. . 


Figures 3 and 4 shows how the DP8513 works in conjunc- 
tion with the DP8515 video shift register which has an on- 
board FIFO to simplify the load clock timing. The sequence 
begins with the processor issuing a video unblank signal to 
the DP8513’s ENIN1 input. This begins a train of load clock 
pulses out of the DP8513’s LCLK1 output synchronous with 
the next load clock transition. These pulses will travel down 
the backplane and result in data being strobed out of the 
VRAM’s serial port. The DP8513’s ENOUT1 pin to ENIN2 
pin connection results in a train of load clock pulses © 


- (LCLK2) beginning one load clock cycle after LCLK1 which 
~ also travels down the same backplane path. This LCLK2 
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train of pulses will be routed back to the motherboard along 
the same path as the VDRAM’s output data. The LCLK2 
train of pulses thus strobe the VDRAM data into the video 
shift register’s FIFO port with a setup time dependent only 


‘on the VDRAM access specification. The connection of 


ENOUT2 to ENIN3 on the DP8513 provides a train of load 
clock pulses (LCLCK3) which lags the LCLK2 output pulse 
stream by 4 load clock cycles. The LCLK3 train of pulses 


-are ECL and are used to load the FIFO data into the shift 


register. This offset enables the FIFO to fill itself up and 
allow VDRAM to FIFO data write transfers to operate asyn- 
chronously from data read transfers to the shift register. 


The DP8514’s oscillator outputs, X OUT and DATA OUT, 
which are routed out to the backplane are received and 
resynchronized on all the boards by a D flip flop’s data and 
clock inputs residing within either a DP8513 or DP8514. 
This method of sending the reference clock around the sys- 
tem ensures that the receiving device can easily regenerate 
a synchronous output with a controlled duty cycle irrespec- 
tive of the distortion introduced by the backplane drivers 
and receivers. This becomes increasingly important as MOS 
processors are driven closer to their rated operating fre- 
quency. The DP8514’s architecture additionally minimizes 
system clock skews by virtue of its synchronizer/MOS clock 


driver functional block being identical to the DP8513’s. 


This configuration results in the MOS clock transitions coin- 
ciding with the pixel and load clock phase transitions gener- 
ated by the DP8513’s PLL functional block. 


Utilization of a stable low cost crystal resonator as a refer- 
ence source and matched internal delay paths enable the 
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critical graphics clocks generated by these devices to exhib- 
it good temperature and supply voltage insensitivity. The 
combination of these two products provide system timing 
solutions which are difficult to obtain with off the shelf MSI 
components. 


SCLK 
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FIGURE 3. System Configuration and LCLK Waveforms Using a VSR Containing FIFO 
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DP8512/13/14 
Video Clock Generator 
Evaluation Board 


GENERAL DESCRIPTION 


This Evaluation Board was designed to demonstrate some 
key features of the Video Clock Generator family of integrat- 
ed circuits. Three chip types will be discussed in this note, 
those being: DP8512, DP8513 and the DP8514 (DP8530 will 
not be discussed). 


The DP8512 is a clock generator designed to drive high 
speed ECL shift registers and DAC’s in a graphics system. It 
generates an ECL pixel clock (PLCK), load clock (LCLK) 
and MOS microprocessor clocks. The DP8512 includes an 
on-chip crystal oscillator circuit (XTLB, XTLC). 


The DP8513 is identical to the DP8512 with the exception 
that the crystal oscillator inputs are replaced with TTL inputs 
designed to accept a pair of external reference signals. 


The DP8514 is a crystal clock generator designed to gener- 
ate a 2 phase non-overlapping MOS Clock derived from an 
on-board crystal oscillator circuit. It also includes a synchro- 
nizer circuit used to eliminate clock duty cycle distortion in 
multi-board applications. The DP8514 interfaces directly 
with the DP8513. Three configuration schemes for these 
clock chips are shown in Figures 7 through 3. 


Figure 7 shows the selection of clock components for a 
single board design. The appropriate clock product is a 
DP8512 for a single board application. All the necessary 
clocks are generated on-chip in conjunction with an external 
crystal. All the outputs from the DP8512 are terminated 
within the domain of the single board. 


Figure 2 shows the most common multiple board configura- 
tion. The appropriate clock products for.a multi-board de- 
sign include a DP8513 and multiple DP8514’s. Alignment of 
clock edges from board to board is optimized by utilization 
of multiple DP8514 clock devices. This device helps to elim- 
inate performance degradation due to delays and duty cycle 
variations down the backplane. The DP8513 is designed to 
accept the backplane reference signal generated by the 
DP8514. All the high speed ECL signals related to the 
DAC’s and Video Shift Registers are confined to the mother 
board. No ECL signals need to be shipped across the back- 
plane. Each of the daughter boards contains a DP8514 
which drives its local BITBLT units and frame buffer memo- 
ries. The portion of the DP8514 which is used on the daugh- 
ter board is a re-synchronizer/MOS clock driver for driving 
the local BITBLT and frame buffer memories. The 
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DP8514 receives 1f and 2f reference clock signals from 
the backplane originating from a DP8514 on the mother 
board. The DP8514s on the daughter boards receive these 
two signals and output a 1/, 2 phase non-overlapping MOS 
clock. This clock is now duty cycle corrected by the 
DP8514’s resynchronizer circuit. 


Figure 3 shows a configuration which enables a user to syn- 


chronize very high speed ECL signals in a multi-board appli- | 


cation. The ECL signals are all phase aligned to a common 
low frequency clock derived from the backplane. This de- 
sign eliminates the need for an ECL controlled impedance 
backplane bus. 


The Evaluation board demonstrates attributes of each of 
the above three figures. The board is capable of exercising 
the DP8512/13 to its maximum pixel clock rate of 225 MHz. 
it contains one socket which accepts a DP8512 or DP8513 
device (U1) and two DP8514 sockets. One DP8514 (U2) is 
configured as a crystal oscillator without using the MOS 
clock driver section. The other DP8514 (U3) is configured as 
a MOS clock driver. Jumpers on the board (J1 thru J4) allow 
flexibility of reference clock signal routing. 
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FIGURE 1. Clock Chip DP8512 In Single Board 
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FIGURE 2. Multiple Board Configuration 
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FIGURE 3. Special Multi-Board Configuration Using DP8513 Devices 


CIRCUIT DESCRIPTION 


The DP8512/13 devices are digital PLLs containing two 
phase comparators whose outputs are brought out at 
OPAMP1 and OPAMP2 respectively (see Figure 4). Only 
OPAMP1 is used in this Evaluation Board. The refer- 
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ence input to the phase comparator is SOUT, whose origin 
is either a crystal for the DP8512, or an external source for 
the DP8513. 
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FIGURE 4. Schematic of DP8512/ 13/14 Evaluation Board 
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The other input to the phase comparator is LCLKO, which is 
the divided down VCO signal output of the L Counter. The 
LCKLO signal is phase and frequency compared to SOUT. 
Any phase error between these two signals results in a cor- 
rection of the voltage into the external VCO control element, 
varactor VR1, via the OPAMP1 output. This voltage is pro- 
portional to the amount of phase error and tends to drive 
the frequency of the VCO in the direction which, when divid- 
ed down, minimizes the LCLKO to SOUT reference signal 
phase difference. When the phase transition of LCKLO oc- 
curs before that of the reference input (SOUT), the VCO 
frequency is sensed as being too fast and produces a nega- 
tive going correction voltage to VR1. This in turn slows 
down the VCO and delays the following occurrences of the 
LCLKO’s phase transition. The device’s PLL architecture 
makes it possible to generate output clocks with frequency 
tolerances identical to the crystal reference. In addition, 
when the reference input signal is a backplane signal, the 
generated clocks are positive transition phase aligned with- 
in +2 ns to this reference due to the closely matched de- 
lays in the phase comparator. 


Both the Primary and the Secondary phase detectors have 
negative edge sensitive inputs. However, the Primary phase 
detector inputs appear on LCKO and SOUT via inverters. 
This gives the appearance that the Primary loop phase de- 
tector is positive edge triggered. 


Before beginning to describe the circuit, it must be noted 
that this application board only supports the major functions 
of the DP8512/13. The Secondary Loop function for Gen 
locking and various load clock enable functions is not dem- 
onstrated. 


The DP8514, component U2, functions as a stand alone 
crystal oscillator. The crystal frequency and half crystal fre- 
quency are output on pins XOUT and DATA OUT. These 
two signals are intended to be sent out on to a backplane 
and used as a frequency reference by either clock genera- 
tors or MOS clock drivers located on each of the boards in 
the system. When using the DP8514 as a frequency refer- 
ence for the DP8513, it should be noted that the half crystal 
frequency is the effective input reference frequency. The 
DP8514, component U3, demonstrates the device used as 
a synchronizer and MOS Clock driver. The synchronizer is a 
D register which has a clock input, REFCLK, a data input, 
REFIN and a mode control input, SEL, which allows the 
REFCLK input to control the synchronizer’s output. This fea- 
ture allows either the clock or the resynchronized + 2 clock 
to be fed to the MOS clock driver. The DP8514’s SYNC 
output is a TTL equivalent output in phase with the PHI1 
MOS clock output. The SEL input is grounded on the board 
to demonstrate the resynchronized clock input mode. 

The DP8512/13, component U1, is the Graphics Clock 
Generator. It utilizes a high frequency VCO (Voltage Con- 
trolled Oscillator) phase locked to a crystal reference fre- 
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quency. L1 is the major element in the VCO. Varactor VR1 
and capacitor C14 complete the resonant circuit. The induc- 
tance value of L1 is chosen by the user to operate the VCO 
at the desired pixel clock frequency. When using a DP8512 
in the U1 position, U2’s reference frequency inputs should 
be disconnected by pulling out the jumpers J1, J2 and plac- 
ing the crystal and capacitors C15 and C16 into their posi- 
tions. Note that the DP8512 crystal frequency is not divided 
by two as in the case of the DP8513/DP8514 combination. 


Varactor VR1 is controlled by OPAMP1, a phase compara- 
tor output. The voltage on this pin has a compliance of 
about 4V. This is sufficient to vary the capacitance of VR1 
from 25 pF (4V) to about 40 pF (1.0V). C13 is a DC blocking 
capacitor. It should not be increased in value much above 
470 pF, otherwise it may affect the loop response since R2 
and C13 form a low-pass filter. C17, C18 and R1 form the 
primary loop filter. The value of these components are dis- 
cussed in the Loop Filter Calculations section. 


Jumpers (J6) are provided on the board to program the S 
and L counters. These jumpers are labeled LO, L1, L2, L3 
and SO, $1 and S2. These jumpers program U1 to divide the 
reference frequency in accordance with Tables | and Il. The 
S counter divides the crystal reference frequency down to a 
load clock rate compatible with VRAM cycle times. The L 
counter provides the appropriate division rate for the Video 
RAM parallel to serial conversion operation. Note that in- 
puts LO thru L3 are bypassed to Analog +5V by C9 thru 
C12. This is very important for correct device operation. 
They should be very close to the device. In some applica- 
tions noise on supply or VCC lines can cause problems. 
Capacitors C9 thru C12 eliminate the problem. R3 thru R6 
are provided to terminate the ECL pixel and load clocks. J5 
allows the termination resistors to be returned to VCC — 2V 
or ground. 
TABLE I. S Counter Division Codes 


| on/p Sperber 
| sz | si | so | 


1 


eet 


; 
- 


- 
- 


L = TTL Level Low Input 
H = TTL Level High Input 
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TABLE Ii. Counter Division Codes 


O = Vee or OPEN 
1 = VecLo 


LCLKO, SOUT, PHI1A and PHI2A are available for on-board 
monitoring. During operation, LCLKO is phase locked to 
SOUT. 


USING THE DP8512/13/14 EVALUATION BOARD 


There are three key items to adjust before using the evalua- 
tion board. The first adjustment involves setting the VCO to 
operate at the desired pixel clock frequency. The values of 
the tank components, L1, C14, VR1, determine the VCO 
oscillation frequency. | 


The second item involves setting the desired Video Shift 
Register word width. The width of the video word deter- 
mines the modulus of the L counter. For example if the vid- 
eo word is 16 pixels long, then the L counter is set to divide 
by 16. The modulus of the L counter is settable in incre- 
ments of 4 bits up to 64. Setting is done on the board by 
moving the jumpers on J6 (LO, L1, L2 and L3) to either the 
Logic “HI” or Logic “LO” position. 

The third adjustment involves setting the S counter modulus 
to allow the graphics processor to operate at the highest 
possible rate which is some multiple of the load clock fre- 
quency. The PLL uses the LOAD CLOCK and SOUT fre- 
quencies internally as phase detector inputs for the main 
loop. Phase locked operation occurs when the frequency of 
these inputs are identical. As mentioned previously, the S 
counter can be programmed to divide the crystal reference 
frequency by any number from 1 to 8. For example if we 
select a 20 MHz crystal for the DP8512, a graphics proces- 
sor can derive its system clock from the PHI1 and PHI2 
outputs at 20 MHz. If the S counter is set to divide by 2, (SO 
= High, Si = Low, S2 = Low) then the actual frequency 
available at SOUT and simultaneously applied to the PLL, 
will be 10 MHz. 


Using the above example, let us choose a VCO frequency 
of 80 MHz and set the L counter accordingly. 


fvco = 80 MHz 

fsout = 10.0 MHz 
Therefore L counter modulus is 80 + 10 = 8. Once this has 
been done the board is ready to use. 


INITIAL BOARD ADJUSTMENT PROCEDURE | 


Note: Although this procedure is specific to the 
DP8512/13/14 Evaluation Board, it can be used as a gener- 
al procedure for setting up or trouble shooting most boards 
using the Video Clock Generator PLL circuits. 


1. Apply +5V Power to ANALOG +5V and DIGITAL + 5V. 
For single supply operation, use the traces on the bottom 
of the board to short the Analog + 5V and Digital + 5V. 
Analog and Digital Grounds also have traces which can 
be shorted together. 


2. Apply +3V to the ECL Termination resistors (Load Pow- 
er). Ensure that jumper J5 is shorting the middle pin to the 
topmost pin. This connects +3V to the common point of 
R3 thru R6, providing ECL loading for the pixel and load 
clocks. | 


3. Connect a voltmeter to pin 15, the OPAMP1 output of the 
DP8512/13. This is the VCO control voltage applied to 
the varactor VR1. A test point is available on the board 
right above R1 for this purpose. While monitoring the 
VCO voltage, adjust L1 until the voltage is about 2.5 Voc. 


4. a. If using the DP8513, ensure that J1, J2, C19, C20 and 
XTL2 are in place, while C15 and C16 are absent. 


b. If using the DP8512, remove J1 and J2 jumpers and 
ensure that XTL1, C15 and C16 are in place. 


5. Check that SOUT and LCLKO waveforms are phase 
locked. The pixel clock outputs will now also be phase 
locked since LCLKO is directly derived from the VCO. 


IN CASE OF DIFFICULTY 


If the board does not operate correctly, check that the S 
and L counters are programmed correctly. There are sever- 
al things that can be checked to quickly determine the 
cause of a problem. Following is a troubleshooting se- 
quence which in most cases will lead to a successful resolu- 
tion of clock generator locking problems. 


1. With an external power source, apply 2.5 Vdc to pin 15 of 
the DP8512/13. This overrides the OPAMP1 pin thus set- 
ting the varactor VR1 to its final desired operating condi- 
tion. 


. Check that the frequency of SOUT is the crystal frequen- 
cy divided by the modulus of the S counter. 


3. Check the frequency of the LCLKO output. It should be 
close to the SOUT frequency. Adjust coil L1 until the 
LCLKO frequency is close to the SOUT frequency. If there 
is no output or an unstable wave form on LCLKO, indica- 
tions are that the VCO tank circuit is not oscillating. 
Check the orientation of capacitors C13 and C14. Induc- 
tor L1 should be a high Q RF coil. If problems persist, 
ensure that C9 through C12 are in the circuit. 


4. Check the frequency and amplitude of the ECL PCLK out- 
puts. They should be about 0.8V peak to peak, at the 
appropriate pixel rate. If not, check that the ECL outputs 
are properly terminated and that J5 is in the correct posi- 
tion. The pixel rate is SOUT times the L counter modulus. 


if L1 does not adjust to the correct frequency, check that 
C14 is connected and is the correct value (C13 should 
remain at 270 pF). Check that L1 is the correct induc- 
tance for the chosen pixel frequency. 


nN 


on 
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6. When using the DP8513 and DP8514 combination, the 
XOUT output of the DP8514 can overshoot below ground 
due to extraneous lead inductance. This may cause erro- 
neous phase lock. Placing a 1002 series resistor and a 
5 pF capacitor to ground will cure this anomaly (the ca- 
pacitor connects to the REFIN side to ground). 


7. Remove the 2.5 Vde supply from pin 15 and connect in its 
place an oscilloscope. Adjust L1 until the OPAMP1 volt- 
age is approximately 2.5 Vdc. LCLKO should now be 
phase locked to SOUT. If the OPAMP1 voltage on pin 15 
is stuck at 4V, either the coil has too much inductance or 
the L counter is set to a modulus that is too high. Con- 
versely, if the VCO voltage is stuck at around 300 mV, the 
coil does not have enough inductance, or the L counter 
modulus is set too low. If the OPAMP1 voltage oscillates, 
the loop compensation components may be in error. Con- 
sult the Loop Filter Calculation section C18, the capacitor 
in series with R1, should be approximately 10 to 15 times 
the value of C17). 


Shown in Figures 5, 6, and 7, are the typical waveforms. 


LCLK 0 


TL/F/10398-5 
FIGURE 5. SOUT vs LCLKO Waveforms 


SOUT 


LCLK 0 


TL/F/10398-6 
FIGURE 6. SOUT vs LCLKO Close Up 
(Note absence of any phase jitter) 
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FIGURE 7. Typical Output Waveforms 


DP8512 LOOP FILTER CALCULATIONS 


Several constants need to be known in order to determine 
the loop filter components. They are the loop divide ratio N, 
the phase detector gain Kp, the VCO gain Ko, the loop 
bandwidth w> and phase margin ¢. 


The constant Kp is fixed at 80 wA/rad for the DP8512. N is 
simply the L counter modulus for the main loop. For the 
secondary loop, N is the S counter modulus times any exter- 
nal division between the SOUT pin and the RGP HORIZ pin 
(i.e., if S = 1 and there is a division by 100 counter between 
SOUT and RGP HORIZ, N = 1 xX 100). A 60° phase margin 
is recommended, however the equations allow other values 
to be used if desired. 


The oscillator gain constant Ky can be obtained from the 
VCO components table or determined experimentally. This 
is done by driving R2, the 27 kN resistor which normally 
connects the varactor to the OP AMP output, with an exter- 
nal power supply. Set the supply to VEE + 3V and note the 
PCLK frequency. Next, set the supply to VEE + 2V and 
note the frequency again. The difference in these two fre- 
quencies (times 27 to convert to radians) is Ko. For opti- 
mum performance the desired PCLK frequency should be 
somewhere between the two frequencies measured above. 
This may require adjustment of the coil. 


Before choosing a value of wo, one fact should be pointed 
out: R2, the 10 kQ resistor and C13, the 270 pF coupling 
capacitor between coil and the varactor, form a low pass 
filter with a cutoff frequency of about 60 kHz. Thus the loop 
bandwidth must be chosen to be less than this value. We 
recommend (1000 Hz to 30 kHz times 277) for ao. 
Having found all these constants, the following equations 
are used to find the component values: 

C18 = 3.46 Ky Ko/N wo 

C17 = 0.27 Kp Ko/N wo? 
To use a phase margin of other than 60° use the following: 

R1 = (N @0/2 Kp Ko) (cosec } + 1) 
C18 = (2 Kp Ko/N wo?) (tan >) 
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Example: Design a system with the following characteristics: 


e External horizontal synch of 76.8 kHz 

¢ 1024 pixels per line (1280 pixels including retrace) 
e 8-bit wide video data 

¢ 20 MHz processor rate 

@ 1024 X 1024 Monitor resolution 

@ 20 MHz processor clock rate 

* 60 Hz refresh rate | 


Note that this system will sync to an external source so that 
both loops must be used. ; 


The PCLK frequency will be: 


60 Hz (Refresh rate) 

x 1024 (Horiz) 

x 1024 (Vert) 

xX 1.128 (H retrace) 

x 1.128 (V retrace) 

= 80 MHz 
The components in Figure 8 will be used. Note that Kp = 
19 Mrad/V. Because it is an 8-bit wide system the L counter 
modulus must be 8. By choosing w, = 27 times 30 kHz the 
equations give: 

R1 = 1500, C18 = 1.0 pF and C17 

= 0.08 wF (use C17 = 0.1 pF) 


VCO INDUCTANCE VALUES FOR DP8512/13 


FVCO 


ee Ae 
aM VLC 


Ci4 Cvri 


C17 


C13 
270 pF 


Li 


VR1 


C14 


TL/F/10398-8 
FIGURE 8. Circuit of VCO External Components 


The Secondary Loop filter component values will be deter- 
mined in a similar fashion. Note that the value for K, for a 
crystal or ceramic resonator VCO are markedly smaller. In 
choosing po, it should be noted that wo for the secondary 
loop should be smaller than wo for the primary loop, so that 
the main loop will be able to track the secondary without 
losing lock (Table III shows various C values versus pixel 
frequencies). 


TABLE Ill. L and C Values versus Pixel Clock Frequencies 


Frequency 


Suggested Varactor VR1 
ee aii (Motorola) 


Ko 
Mrad/V 


DELEVAN 9405-20 MV209 
40 DELEVAN 9405-12 MV209 14 
60 COILCRAFT 146-10J08 ~ MV209 16 
80 COILCRAFT 146-06J08 MV209 19 
100 COILCRAFT 146-04J08 MV209 21 
120 COILCRAFT 150-03J08 MV209 31 
140 COILCRAFT 150-02J08 MV209 27 
160 COILCRAFT 150-02J08 MV209 27 
180 COILCRAFT 150-02J08 MV2205 26 
COILCRAFT 150-01J08 MV2205 
COILCRAFT 150-01J08 MV2205 


COMPONENT SOURCES: 

DELEVAN— AMERICAN PRECISION INDUSTRIES 
Electronic Components Group 
Delevan Division 
270 Quaker Road, East Aurora, NY 14052-0449 
Phone (716) 652-3600 Telex 91-293 


COILCRAFT— COILCRAFT 
Cary, Illinois 60013 
Phone (312) 639-2361 TWX 910-651-2251 


APPENDIX 


DP8512/13 PCB LAYOUT CONSIDERATIONS 


1. Use separate Vcc and GND branches for the analog 
(VECL 0, 1, Vee) versus digital (VTTL 0, 1 and GND 0, 1) 
supply pins. Use of the ferrite beads between the branch- 
es is also highly recommended. 


2. Bypass supplies right at the device. (Preferably right un- 
der the device using surface mount components.) 


3. Ground unused inputs to reduce noise generation (eg. 
ENIN1, ENIN2). 

4. Bypass LO, L1, L2, L3, to VECLO close to the device. 

5. Mount VCO coil L1, varactor VR1, C13 and C14 right next 
to VCO1, VCO2 pins. Ground for VR1 and C14 should 
return to a single point at Veg pin 7. 

6. Place varactor diode and associated DC blocking capaci- 
tor (270 pF) right next to the VCO coil. 

7. Place capacitor from VCO2 to Veg right next to VCO2 
and VCO coil. 

8. If using multi-layer layout with dedicated Vcc and Ground 
planes, ensure that: 

a. For power, VECL1 and VECLO should have their own 
small isolated ground plane (Analog Vcc). 

b. For ground, the two Veg pins should have their own 
small isolated ground plane (Analog GND). 


ADDITIONAL DP8512 APPLICATIONS 
1. External Drive of VCO input. 


Parasitic capacitance limits the upper frequency of the inter- 
nal Pierce oscillator on the DP8512 to approximately 
225 MHz. In order to operate above this frequency, an ex- 
ternal oscillator must be capacitively coupled into the oscil- 
lator pins. When driven externally, operation to 300 MHz 
can be achieved. 

The circuit in Figure 9 can be used to drive the VCO input of 
the DP8512. Note that the VCO1, the base of the internal 
oscillator transistor, (see VCO INPUTS diagram, page 11 of 
DP8512 data sheet) must be pulled up to VCO2, the collec- 
tor. A suggested value of resistance is 2.2 kQ. The drive 
level into VCO2 should be in the range of — 10 to 0 dBm. 


200 pF 


0 to -10dbm 
Gen.Level 


500 DP8512 


TL/F/10998-9 
FIGURE 9. External Drive of DP8512 VCO 


2. External Drive of VCXO input. 


The DP8512 contains an internal oscillator designed to be 
used as a master clock from which the other frequencies 
are derived. This oscillator is generally used with an external 
crystal. This crystal may stand alone, or be phase locked to 
another source via the Secondary Loop within the DP851 2. 
Locking is accomplished via a varactor diode within the 
crystal oscillator circuit (VCXO). 
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If the desired degree of frequency shifting lies beyond the 
range of the crystal, then other oscillator elements must be 
considered. An alternative to a crystal is a ceramic resona- 
tor. Conventional ceramic resonators are not suitable for 
VCO applications due to their spurious anti-resonant oscilla- 
tion modes. Special low Q ceramic resonators are neces- 
sary in this application. 

If desiring to use an external oscillator, special considera- 
tion must be given to interfacing the VCXO pins, namely 
XTLB and XTLC (pins 21 and 22). These pins are not TTL 
compatible. It is recommended that the XTLC pin be driven, 
while leaving XTLB open. The input signal must be in the 
range of 400 mV to 600 mV peak-to-peak. The circuit in 
Figure 10 is suggested. 


4.5V 


DP8512 


TL/F/10398—10 
FIGURE 10. External Drive of DP8512 
Crystal OSC. Inputs 


Note that R1 and R2 form a voltage divider that reduces a 
TTL signal to an acceptable level. When combined with the 
effective input impedance of the XTLC input, the resulting 
signal level is approximately 500 mV p-p at 10 MHz, the 
frequency at which the above circuit was tried. 


Several precautions are necessary in order to ensure that 
extraneous noise not be injected into this input. If an exter- 
nal reference signal is utilized, precautions should be taken 
to ensure that extraneous noise is not injected into the part 
as a result of dissimilar ground references. The internal os- 
cillator is referenced to VEE and consequently R2 should be 
placed close to the DP8512’s VEE pins. The circuit shown in 
Figure 10 will provide approximately the same effective sig- 
nal into the DP8512 XTLC input whether the device is being 
used in either common or split supply mode of operation. 


3. Loop Jitter Considerations 


The most critical layout area of the board is related to the 
VCO. It is very important that the analog supplies to which 
the VCO components are connected are isolated, as well as 
possible, from any digital noise sources. Digital noise im- 
pressed on the VCO signals has recently been observed to 
induce phase modulation onto the VCO’s output frequency. 
Under certain conditions relating to layout, frequency of op- 
eration, power supply bypassing, and output loading, the 
digital noise may cause the loop to exhibit some amount of 
jitter. This can be observed by viewing LCLKO on an oscillo- 
scope while triggering the scope from SOUT. The result will 
be a blurry trace. Careful attention to proper supply bypass- 
ing and good RF layout techniques may eliminate the jitter. 
If it does not, the external VCO circuit shown in Figure 77 
can be used to further decouple the VCO from the power 
supply and reduce the jitter. 
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TL/F/10398-11 


FIGURE 11. External VCO Circuit 


The PN3653 is a high frequency oscillator transistor. To re- 
duce component count the 1000 pF capacitor and the 
27 kQ resistor may be eliminated by simply returning the 
anode of the varactor to the emitter of the transistor. This 
will result in a control voltage range reduction of about 1V 
which may or may not be acceptable depending on system 
requirements. 


DP8512 PHASE DETECTOR/ CHARGE PUMP LINEARITY 
TEST 


Figure 12 demonstrates the zero dead band characteristics 
of the DP8512/13 phase detector or charge pump circuitry. 
Note the linearity at the zero phase offset crossover point. 
When the device is locked, the phase jitter is imperceptible. 


AVERAGE CHARGE PUMP CURRENT= pA 


“10 0 10 20 


-20 
RGP HORIZ 
{EXT REF FIRST— FIRST 


PHASE OFFSET = ns 


. TL/F/10398-12 
FIGURE 12. DP8512/13 Phase Offset 
versus Charge Pump Current 
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By changing T as shown in Figure 13 and plotting the cur- 
rent delivered by CPO2, we are able to see that there is no 
deadband associated with the phase detector. This is guar- 
anteed by the phase detector shut off mechanism which 
requires both the pump up and pump down currents in the 
charge pump become active before the phase detector will 
turn off the charge pump. 


The pulse repetition period used in the test was 250 ns, 
pulse width was 50 ns LOW (phase detector 2 triggers on 
the falling edges) and rise and fall times were 3 ns. 

Phase detector 1 is more difficult to test because of the 
counters in the loop, however the circuitry is identical to that 
of Ph. Det. 2, therefore it also exhibits zero deadband. 


RGP HORIZ 
DP8512 


HP Dual Channel 
Pulse Generator 


4 EXT REF 


CP02 OPAMP2 


Current 
Meter 


ds TL/F/10398-13 
FIGURE 13. DP8512/13 Phase Detect/Charge 
Pump Linearity Test Setup 


> 
DP 8512/13/14 EVALUATION BOARD CIRCUIT LAYOUT Shown also are the PCC Socket diagrams viewed from the = 
Following are illustrations of the DP8512/13/14 Evaluation bottom side (see Figures 17 and 78). A parts list is included | & 
Board printed circuit board (see Figures 14, 15 and 76). in Table IV. > 


TL/F/10398-14 


TL/F/10398-15 
FIGURE 15. Component Side, DP8512/ 13/14, Evaluation Board 
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TL/F/10398-16 
FIGURE 16. Silk Screen, Component Side, DP8512/ 13/14, Evaluation Board 
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TL/F/10368-17 
FIGURE 17. Bottom View, Pinout DP8514 


Ox 


TL/F/10398-18 
FIGURE 18. Bottom View, Pinout DP8512/13 
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DES 


L1 

Ui 
U2 
U3 


XTL1 
XTL2 


R1 
R2 
R3 
R4 
R5 
R6 
R7 
R9 


C1 
C2 
C3 
C4 
C5 
C6 
C7 
C8 
C9 
C10 
Ci1 
C12 
C13 
C14 
C15 
C16 
C17 
C18 
C19 
C20 
C21 
C22 
C23 
C24 
C25 
C26 
C27 


See Table 

DP8512 or DP8513 
DP8514 

DP8514 


40 MHz Crystal 
40 MHz Crystal 


2702 1/4W 5% Res. 
27 kN 1/4W 5% Res. 
5120. 1/4W 5% Res. 
5120 1/4W 5% Res. 
5129 1/4W 5% Res. 
519. 1/4W 5% Res. 
4.7k 1/4W 5% Res. 
5.1k Chip Res. 


10 pF Electrolytic 
0.1 F Ceramic Cap. 
0.01 uF Chip Cap. 
100 pF Chip Cap. 
1000 pF Chip Cap. 
0.01 F Chip Cap. 
0.1 wF Chip Cap. 

10 wF Electrolytic 
0.01 uF Chip Cap. 
0.01 y4F Chip Cap. 
0.01 F Chip Cap. 
0.01 uF Chip Cap. 
470 pF Chip Cap. 

30 pF Chip Cap. NPO 
30 pF MICA Cap. 

30 pF MICA Cap. 
0.01 F Ceramic Cap. 
1.0 wF Ceramic Cap. 
30 pF Ceramic Cap. 
30 pF MICA Cap. 

0.1 uF Chip Cap. 
0.01 F Chip Cap. 
0.01 wF Chip Cap. 
0.1 wF Chip Cap. 

0.1 «F Chip Cap. 

0.1 wF Chip Cap. 
0.01 uF Chip Cap. 


TABLE IV. Parts List 


Description | PartNo. 


See Table Il 
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Suggested Vendor 


See Table III 

National Semiconductor 
National Semiconductor 
National Semiconductor 


NEL Frequency Control Inc. 


357 Belliot St. 
Burlington, WIS. 53105 
(414) 763-3591 
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A Graphics Acceleration 
Card for the AT Using the 
Advanced Graphics Chip 
Set 


1.0 OVERVIEW 


Members of the Advanced Graphics Chip Set (AGCs) are 
used in the design of an AT graphics card. The purpose of 
the design is to use a high-performance graphics microproc- 
essor and coprocessors on the board to improve the AT’s 
performance for graphics applications. 


2.0 ARCHITECTURE 


A graphics processor can be added to the AT’s architecture 
to improve graphics performance. It is given the task of ma- 
nipulating the pixels, freeing the main CPU to spend more 
processing time on other aspects of applications. In an AT 
using one of the standard graphics cards, the graphics 
frame buffer appears as an array in the CPU’s memory map. 
To render graphics primitives, the main CPU must calculate 
the algorithms for the primitives and write the pixels into the 
frame buffer, accessing them one at a time. Hardware on 
the card generates the video timing and collects the data 
from the array to generate video signals for the monitor. 


Figure 7 is the system architecture using the 4-plane board. 
The DP8500 Raster Graphics Processor (RGP) and DP8511 
BITBLT Processing Units (BPUs) are used to render text 
and other objects in the frame buffer and generate the video 
signals. The RGP is optimized for rendering the graphics 
primitives; the BPUs allow parallel execution of the algo- 
rithms in each bit plane. In a 4-bit per pixel design such as 
this one, images can be drawn at rates of up to 64 bits (16 
pixels) per memory access cycle. The number of pixels per 
cycle is constant for a RGP/BPU system independent of the 
number of bit planes. For example, a system with 32 bits per 
pixel renders at up to 512 bits per memory cycle, a rate that 
is difficult to attain with a general-purpose CPU architecture. 


The 4-plane board’s memory is dual-ported to allow the 
RGP and main CPU to use it as a communication area. 
Upon power-up, the main processor resets the board and 
loads the RGP code into its memory. The RGP executes its 
ROM monitor after a reset has occurred; the main CPU tells 
it to branch to the start address of the downloaded code. 
The ROM monitor is also used with the debugger, which 
runs on the AT. In a typical application, the host CPU gives 
display lists to the RGP by writing them into its memory. 
After deposit of the display list the main CPU can resume 


execution of the application, while the RGP renders the ob 


jects in the screen buffer. 


Note that the host still has direct access to the screen buff- 
er, as in the standard architecture, allowing use with appli- 
cations that do not take advantage of the graphics proces- 
sor. Images can also be downloaded into the screen buffer 
directly from disk. 


3.0 FUNCTIONAL BLOCKS 


The following sections explain the design of each of the 
functional blocks in the architecture. Figures 14a through 
74m are the schematics. Figures 15 through 37 are the 
equations for the GALs. Figure 32 is the parts list. 


National Semiconductor 
Application Note 609 
J. Margeson 


3.1 Clock Generator 


The DP8512 Video Clock Generator (U2), with Y1, C3, C4 
and U3, generates the video and bi-phase CPU clocks. The 
DP8512 data sheet explains this circuit in more detail. The 
RGP and BPUs used the 20 MHz bi-phase clock (PHI1 and 
PHI2). The clocks are buffered by US to go to other sections 
of the board. CK1A is used by the bus arbiter/timing genera- 
tor and video plane controller. The parallel interface se- 
quencer is clocked by CK1B. CK1C is used as the 20 MHz 
pixel clock. In other designs, the pixel clock could be much 
higher; it is not limited by the RGP. CK2 clocks the regis- 
tered PROMs and is used by U6 to generate the VDRAM 
row/column select, RSEL. The SOUT output is divided 
down to 1.25 MHz for use as LCK. 


3.2 RGP 


The DP8500 Raster Graphics Processor (RGP) executes 
graphics rendering tasks given to it by the main CPU and 
generates the video timing. The RGP asserts ALE to re- 
quest a memory cycle; BS1, BSO, R and W indicate the type 
of cycle. It fetches instructions, accesses operands, uses 
the BPUs to execute drawing cycles, and loads the shift 
registers in the VDRAMs (video dynamic random access 
memories) with video data for display refresh. Execution of 
these memory cycles is discussed in the Bus Arbiter/Timing 
Generator section. 


The lower 16 address lines are latched by U7 and U8 on 
ALE. U11 buffers the upper 8 address lines. U9 and U10 
buffer the data lines. These five |.C.’s are in TRI-STATE® 
when the AT is accessing the RGP’s memory. 


Video timing is generated from LCKL. Rising edges of LCKL 
are counted inside the RGP to generate DRREQ (display 
refresh request), VSYNC, HSYNC and BLANK, according to 
the software programmable video parameters. LCK, in this 
design, is 1.25 MHz (800 ns), representing 16 pixels; the 
video timing can be adjusted in software to this resolution. 


The HOLD signal is tied to Voc by jumper W1. This jumper 
should be in place, unless hardware modifications are made 
to the board. 


The details of the RGP operation are discussed in the 
DP8500 data sheet. 


3.3 Parallel Host Interface 


The AT can read and write devices in the 4-plane board’s 
memory map. To execute one of these memory cycles, it 
asserts the proper address and strobe. The AT interface 
state machine handles the bus protocol; it holds the AT and 
requests to the memory arbiter for a remote access memory 
cycle. Data is transferred when the arbiter grants the cycle, 
then the AT is released. The AT bus does not have 16M 
words of space to map the board into, so segment registers 
are required to hold the upper portion of the on-board ad- 
dress. There are also control registers for reset, interrupts 
and status. 
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FIGURE 1. System Architecture 
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U220 generates BDSELL (Board Select Latched, active 
low) and M16 (Memory Select 16-bit, active low) from AEN 
and the upper eight address lines of the AT bus. BDSELL is 
active when AEN is low and the proper address is asserted 
and is latched by BALE. The GAL is used as a transparent 
latch in this case by using BALE as an enable and feeding 
back BDSELL. The board resides in a 64k byte area in the 
AT memory map, the beginning of which is set by jumpers 
W2, W3 and W4. The base address can be set to OA0000 
(W2), OD0000 (W3), A00000 (W4) or any combination (there 


can be multiple images of the board if desired). It is recom- 


mended that the board not be used at base address 
OA0000 or ODOO00N. The board can only accept 16-bit ac- 
cesses; byte accesses result in errors. AT address line 0 
must be “0” for word transfers. If an alternate base address 
is desired then U220 can be reprogrammed. 


The AT’s processor, in the mode used by DOS, can only 
access the first megabyte of the 16 megabytes addressable 
by the bus. The 4-plane card can reside above OFFFFFH if 
one of three schemes is used to access it: the AT’s proces- 
sor is switched into protected mode (allowing it to assert 24 
address lines instead of 20), DMA transfer is used, or a co- 
processor card is used that is capable of bus mastership. In 
these cases, the board can be set to reside at AOOOOOH— 
A1FFFFH (a double image). 


The interface is designed to execute 16-bit transfers to max- 
imize the data transfer rate. The M16 signal is generated 
combinationally from the upper seven address lines. This 
signal tells the AT that a 16-bit transfer can be executed. If it 
is asserted too late, the access is split into two eight-bit 
accesses. Address line 16 can not be used to generate M16 
because it becomes valid much later than the upper seven 
address lines. In cases where the board is set to reside in 
the lowest megabyte of the AT bus space, address line 16 
must be ignored, requiring that the neighboring 64k byte 
area have a sixteen bit device mapped into it or no device at 
all. If the board resides in the upper 15 megabytes, then a 
128k byte space is used (a double image in memory), and 
address line 16 is ignored. 


When accessing the 4-plane board, 24 address lines must 
be asserted. The AT can not generate all of them during a 
read or write cycle, so the upper ten lines are held in a 
segment register. There are two segment registers on the 
board: one moveable pointer and one fixed pointer. The 


moveable segment pointer is set before the access is made. — 
AT data line 0 corresponds to RGP address line 14, through — 


AT data line 9 which corresponds to RGP address line 23. 
At the time of the access, the upper ten lines are driven by 
the register (U19); the lower 14 lines come from the AT bus, 
a segment size of 16k words. This register is also readable 
by the AT (U20). The board does not use RGP address lines 
16 and 19 which correspond to AT data lines 2 and 5. When 
the register is read, data lines 2, 5, and 10 through 15 are 
undefined and must be masked if the address is to be used 
in any calculations. However, the register can be restored 
exactly as it was read because the board ignores these bits. 
For multiple accesses within the same segment, the seg- 
ment pointer only needs to be set once. The offset into the 
area is determined by the lower AT address lines. AT ad- 
dress line 1 corresponds to RGP address line 0, on up to AT 
address line 14 which corresponds to RGP address line 13. 
This scheme allows the AT to write to any memory location 
on the board. 
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The fixed segment register points to a 8k word contiguous 
area mapped onto the end of memory plane 3, RGP ad- 
dresses 46E000-46FFFF. Using the fixed segment area 
eliminates the overhead of setting the pointer and allows 
multiple software tasks access the board without needing to 
save and restore the pointer. The monitor and debugger use 
this as a communication area. The fixed segment pointer 
drives the upper eleven address lines; the lower 13 lines 
come from AT address lines 1 through 13. Address lines 19 
and 16 are not used on the board, thus are not driven. 
The control and status registers are explained in the AT bus 
memory map. The AT can reset the board, assert its non- 
maskable interrupt and maskable interrupt and examine its 
status. The RGP can deassert the maskable interrupt. 

The RGP can interrupt the AT by writing to a register in its 
memory map. The AT can deassert the interrupt. W10-—W21 
select the interrupt to be used. 


AT Bus Memory Map: 


NNOOOO is the base address as selected by W2, W3 and 
W4. 


NNOOOO-NN7FFE: Moveable Data Area. The AT address 
can be derived in a C program by: 
AT__addr := ((RGP_addr < < 1) & 
Ox7ffe) + OxOONNOOOO; 
Read and Write. 

NN8000—NNBFFE: Fixed Data Area. The AT address can 
be calculated by: 
AT__addr := ((RGP__addr < < 1) & 
OxSffe) + OxOONN8000; 
Read and Write. 


NNCOOO: Segment Pointer. The segment pointer 
can be calculated by: 
segment_pointer := (rgp_addr > > 
14) & OxO3FF; 

Read and Write. 
NNC800: — NMI Register. Any write to this location 
i temporarily asserts the RGP’s NMI line, 
which is edge sensitive. Write only. 

NNDOOO: AT Interrupt Reset. Any write to this lo- 
cation deasserts the interrupt from the 
4-plane board to the AT. It can be used 
to clear the interrupt when it is serviced. 
Write only. 

NND800: Board Status. The AT can read the 

| present state of four signals on the 

board: 
D3: RGP Reset (RSTI) 
D2: RGP Halt (HALT) 
D1: RGP Interrupt (INT) 
DO: AT Interrupt (ATINT) 
Read Only. 

NNEOOO: RGP Reset (RSTI). The RGP can be re- 


set by writing a “1” into this register, 
followed by a ‘‘0”’. It must be set to 1” 
for a minimum of 800 ns. Reset initializ- 
es the board, causing the RGP to begin 
execution of the ROM monitor. The de- 
fault screen will be cleared and the re- 
maining VDRAM will be undefined. 
Write only. 


NNE800: RGP Interrupt (INT). The RGP’s level 
sensitive maskable interrupt can be as- 
serted by writing a “1” into this register. 
Writing a “0” removes the interrupt re- 
quest. The RGP also has the ability to 


reset the request. Write only. 


The AT interface sequencer is implemented in U221. Ac- 
cesses to the control registers are transparent to the inter- 
face sequencer; only data transfers requiring a 4-plane 
memory cycle are handled by the sequencer. A flowchart of 
the sequencer is shown in Figure 2. The sequencer remains 
inactive until the host attempts a data transfer to/from the 
board’s memory. In the beginning of the access, the ad- 
dress lines on the AT bus become valid, causing U220 to 
assert the M16 signal. U220 latches BDSELL on BALE. If 


The read or write strobe follows, causing U223 to asynchro- 
nously return REMRDY low (an open collector line). Figure 
3 shows the timing of the access. While the AT is being 
held, the strobe is synchronized to CK1B so that it can be 
input to the interface sequencer. The strobes are effectively 
latched twice for metastability: once before the sequencer, 
and sequencer transitions based on the strobes change 
only a single bit of the state variable. The first transition from 
the idle state sends a remote access cycle request to the 
arbiter (REMREQ). The arbiter/timing generator state ma- 
chine executes a remote access cycle when it is the highest 
priority cycle requested. It is designed such that the AT is 
held much less than the maximum 2.1 js in the worst case. 
The time period for the data transfer is delimited by the 


REMACK signal. In the case of a write, the data is driven 
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the card is being accessed with a DMA cycle or by a coproc- onto the 4-plane’s data bus by U23 and U24. For a read, 


essor card, BALE remains high and the upper address lines 
remain valid throughout the entire cycle. 


REMACKD = 0 


REMREQ = 0 Idle 
RDYHI = 0 


/BDSELL~& (REMRL + REMWL) 


REMACKD = 0 
REMREQ = 1 
RDYHI = 0 


Request remote access cycle 


/REMACK~ 


REMACKD = 1 
REMREQ = 0 
RDYHI = 0 


Execute Access 
(Always five states) 


/REMACK~ 


REMACKD = 0 
REMREQ = 0 
RDYHI = 1 


Bring AT ready line high until 
read/write strobe is removed 


/BOSELL~& (REMRL * REMWL) 


TL/F/10429-2 
FIGURE 2. AT Interface State Machine Flowchart 
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TL/F/10429-3 
FIGURE 3. Host CPU Access Timing 


the data is latched into U23 and U24 on the rising edge of 
REMACK. After the transfer is made on the 4-plane board, 
RDYHI is asserted to inhibit the REMRDY signal. The data 
is driven onto the AT bus until the read strobe is removed. 
The arbiter/timing generator can continue executing other 
types of memory cycles once the transfer is completed, be- 
fore the AT removes its read or write strobe. The interface 
sequencer returns to the inactive state after the host brings 
its read or write strobe high. 


Accesses to the data area by the AT are longer than most 
AT bus cycles, so their frequency should be minimized. 
These accesses also consume 4-plane board memory 
bandwidth, but will usually represent a small portion of it. 
They should not significantly reduce the RGP’s rendering 
speed in most applications, because the number of memory 
cycles required to render objects is typically much greater 
than the number of accesses needed to transfer display 
lists. Control register accesses (addresses NNCOOO- 
NNFFFE) do not add wait states to the AT bus cycle. 


3.4 VDRAM Refresh Generator 


VDRAM refresh requests are generated by a counter in 
U208. LCKL is divided by 20 to request cycles at 16 ys 
intervals. U6 holds the request until it is reset during refresh 
cycle by the timing generator. The request is synchronized 
in U18 to make setup time for U200. 


3.5 Memory Arbiter/Timing Generator 


Several actions requiring memory cycles have been dis- 
cussed. Requests for these memory cycles come from vari- 
Ous sources; this circuit services the requests on a priority 
basis and generates the timing for them. 


The arbitration and timing generation is done by a sequenc- 
er implemented in U200. Figure 4 is the flowchart of the 
sequencer. Six types of memory cycles are executed, listed 
from highest to lowest priority: 


A Display Refresh 

B DRAM Refresh 

C __ Host Processor Access 

D _Instruction/Operand Read or Write 
E1 Drawing Read or Write 

E2 Drawing Read/Modify/Write 


Highest priority is given to branch A, the display refresh cy- 
Cle; it is the most latency-sensitive of the six cycles. This 
cycle is used to load the shift registers inside of the 
VDRAMs for video signal generation. The RGP generates 
the request for this cycle and the address for the load. 


The RGP internally arbitrates between display refresh and 
its processing and drawing cycles. A display refresh cycle 
request can be queued behind another RGP cycle that is in 
progress. The worst-case delay for entering the display re- 
fresh cycle occurs when the RGP requests a drawing 
read/modify/write cycle just before it internally generates 
the display refresh request, preceded by a host processor 
access request, followed by a VDRAM refresh and a second 
host processor access request. The AT access is serviced 
first because the request arrives before the VDRAM refresh. 
The VDRAM refresh is serviced second, followed by the 
second AT access, the drawing cycle and then the display 
refresh. Two VDRAM refresh cycles will not occur in close 
proximity, which is also true of the display refresh requests. 
Once the cycle is entered, the sequencer can be held by the 
MIDSCAN signal to insure proper timing of the shift register 
load. If the load is done during video blanking then the cycle 
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is not held; the shift registers are not clocked during this 
period and timing of the load is not critical. If the load occurs 
in the middle of a scan line, then the sequencer is held until 
after the last shift of the previously loaded data. 

The VDRAM refresh cycle, branch B, is second in priority. 
The refresh requests are generated every 320 clock cycles. 
The request must be serviced and cleared before another is 
generated. The worst-case latency occurs when this re- 
quest is made just after the RGP has started a drawing 
read/modify/write and it generates a display refresh re- 
quest following the drawing cycle. This guarantees that 


RSTRRQ is asserted and deasserted before the next re- 


quest is generated. 

The remote access cycle, C, is third in priority. The AT must 
be held until the remote access is completed, but the AT 
bus cycle must not be stretched longer than 2.1 ws (42 
clock cycles). If the bus is held too long then the AT’s DMA 
controller may miss refresh cycles for its DRAMs. The worst 
case occurs when the AT initiates a remote access just after 
a read/modify/write cycle has been started, followed by a 
VDRAM refresh request and a display refresh request. The 
remote access request will not be serviced until these three 
cycles are completed; the read/modify/write cycle can not 
be interrupted after it is started and the other two cycles are 
higher in priority. 

Branches, D, E1 and E2 execute instruction/operand mem- 
ory accesses, drawing read/writes, and drawing 
read/modify/writes respectively. These three cycles are 
used by the RGP for processing and drawing and are effec- 
tively equal in priority as it will never request more than one 
of them at once. They are lowest in priority because they 
have no particular maximum latency; delaying them only 
slows RGP code execution. 

The VDRAM access time is reduced when consecutive 
drawing accesses have the same row address The drawing 
algorithms tend to operate on memory locations in close 
groups. The memory page size is 256 words, which is equiv- 
alent to 4k pixels or 6.4 scan lines. The RGP generates a 
page break signal to indicate if a row address boundary has 
been crossed. At the end of a drawing cycle, the sequencer 
can loop back into another drawing cycle of either type, 
keeping RAS low and reducing the cycle time by 100 ns. 
This will occur if: a drawing cycle is requested, it is in the 
same page and there are no requests for higher priority cy- 
cles. A display refresh cycle request will cause the RGP to 
break up the page of drawing accesses, because the arbi- 
tration in this case is done inside the RGP. Request of a 
VDRAM refresh or host access causes the arbiter to force a 
page break. 

The AT interface sequencer requests a host access cycle 
by bringing REMREQ low. REMACK is returned to indicate 
when the transfer takes place. While REMACK = low, the 
buffers driving the address and data lines from the RGP are 
disabled so that the AT interface circuitry can drive them. 
REMACK also controls the multiplexing between the RGP’s 
and AT’s read and write lines. 

The REMREQ (Remote Request, active low) signal so that 
the host bus interface sequencer can indicate to the bus 
state machine that a remote access cycle is needed. 
REMACK (Remote Acknowledge, active low) indicates 
when the bus state machine is executing a remote access 
cycle. The host bus interface sequencer uses this acknowl- 
edge signal to determine when the host data is to be trans- 
ferred to/from RGP memory. 
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The sequencer equations in Figure 5 were compiled from 
the sequencer flowchart in Figure 4. Figure 6 shows the 
equations after minimization. The states were renumbered 
so that the equations minimized into sets of product terms 
that fit into the GAL20V8. Each output has eight available 
product terms. ST1 requires ten product terms, so the result 
of three of them was fed back from another output. The 
setup time is critical on some of the inputs, so product terms 
dependent only on present state and RESET were fed back. 


Figure 7 shows how the memory control signals are gener- 
ated from the state variables; the minimized equations are 
shown in Figure 8. This logic is implemented in U201 and 
U202. 


3.6 Address Decoder/ Memory Control 


The frame buffer is organized as four memory planes, as 
shown in Figure 9. Each pixel is represented by one bit in 
each plane. Each word in the array contains one bit of six- 
teen pixels and is mapped to a unique address in the global 
address space. Multiple memory planes can be accessed at 
once during drawing cycles, so a separate drawing address 
space is used. This space ranges from 000000 to OOFFFF; 
each location corresponds to four words, one in each plane. 
In cases where one pixel is accessed such as line drawing, 
B(3:0) specify which of the sixteen pixels is selected. When 
INTPL = 0, the drawing source read comes from within 
each plane, as shown in Figure 10. Figure 17 shows the 
data path for a source read when the interplane bit is set. In 
this case, data is read from one plane into the four BPUs 
through the global data bus. The interplane BITBLT allows 
images to be copied from one plane to another and is con- 
venient for rendering characters in all planes from fonts 
stored in a single plane. The data path for a destination 
write is shown in Figure 12. 


Some of the memory control signals a are held in U16. This 
register is set before a drawing operation is executed. The 
write mask, WMASK (3:0), is used to enable/disable planes 
for drawing operations. The interplane bit, INTPL, is used to 
select the BITBLT data source. Three of the bits in the reg- 
ister are used as flags. The flags are connected to J7; 
FLAGO drives an LED. 


The BPU control register enable, BPU function register se- 
lect enables, and device selects for the write mask, UART 
and PROMs are generated by U204. PDLE and POE are 
used to access the pixel port and are generated by U206. 


The column address strobes, CAS(3:0), are generated by 
U203. The control lines BE(3:0) and DIR(3:0) for the buffers 
between the main data bus and the data busses on each 
plane (U102, U103, U112, U113, U122, U123, U132 and 
U133) are generated by U205 and U206. The row and col- 
umn addresses are multiplexed by U12 and U13. U6 con- 
trols the multiplexer by delaying RAS a half clock cycle. 


The BPUs are run one clock cycle behind the RGP so that 
some of the status signals generated by the RGP are avail- 
able earlier. The BPU control signals are delayed by U4. 
The details of the BPU operation: are discussed in me 
DP8511 data sheet. 


Software written for the DP850EB Evaluation Board can run 


on this board if the video parameters are altered; the old 
memory map is a subset of the new one. 
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RGP Memory Map: 


000000-0007FF: Monitor PROM (2k Words, Word = 


Bits) 

Bitplane 0 (64k Words) © 
Bitplane 1 (64k Words) 
Bitplane 2 (64k Words) | 
Bitplane 3 (64k Words) 


400000-40F FFF: 
420000-42FFFF: 
440000-44FFFF: 
460000-—46F FFF: 


800001: BPU Control Register Enable (CRE) — 

800002: BPU Pixel Data Latch Enable (PDLE) 

800004: BPU Pixel Output Enable. (POE) 

A00001: BPU Function Select Register Enable 
(FSEO Bitplane 0). 

A00002: BPU Function Select nemeter Enable 
(FSE1 Bitplane 1) 

A00004: BPU Function Select Register pnepe 
(FSE2 Bitplane 2) 

A00008: BPU Function Select aueien enable 


(FSES Bitplane 3) 


UART. The UART should not be ac- 
cessed by the AT. - | 


RGP Maskable Interrupt (INT) Reset. 
The RGP can reset its INT line to inac- 
tive, after the host has asserted it, by 
writing any data. Write only. 


Video Plane Write Mask Register. The 

_ bits in the mask register are the same 
as before. An LED is driven by FLAGO; 
it is on when the flag is set to zero. 
Write only. 


AT Interrupt Register. The RGP can in- 
terrupt the AT by writing with DO = “1”. 
The interrupt line asserted is deter- 
mined by the jumper arrangement on 
the board. Writing with DO = “0” resets 
the AT interrupt. The AT can also re- 
move the interrupt. Write only. 


3.7 Video Generation 


The video timing is generated by the RGP. The rising edge 
of LCKL increments the software programmable counters in 
the RGP to generate the video shift register loads, BLANK, 
VSYNC and HSYNC. These signals can be adjusted in soft- 
ware to the resolution of LCKL (800 ns). 


The display refresh memory cycles transfer data into the 
video shift registers inside the VDRAMs. The VDRAM shift 
registers hold 1024 pixels in this system, 256 bits per de- 
vice. The video data is shifted every 800 ns during active 
video, 16 pixels per shift. The clock is derived by masking 
LCKL with BLANKD. Data from the shift register outputs of 
the VDRAMs is multiplexed by using the output enables 
SE(3:0) into the shift registers U100,.U110, U120 and U130. 
These four shift registers can be 25 ns GALs or 74LS195As. 
The data is blanked in U211 during horizontal and vertical 
blanking; the data and synchronization signals are latched 
and buffered by U18. | | 


C00000-C00007: 


Co00000: 


E00007: 


FOOOOO: 


U207 and U5 are used to control the timing of the video shift 
register loads. These memory cycles can occur during 
blanking of the video data or while data is being shifted out 
to the monitor. During blanking, the timing of the load is not 
Critical because the data is not being shifted. In this case, 
the MIDSCAN signal is kept low and the sequencer is not 
held. When data must be loaded in the middle of a scan line, 
the window in which the load must be done is much smaller. 
It must be loaded after the last word of the previous scan 
line is shifted (more than 25 ns after the rising edge of 
GSCL) but at least 20 ns before the next shift clock edge. 
The arbiter/timing generator enters the display refresh cycle 
and waits at state 18 until the MIDSCAN signal is low. The 
load occurs on the rising edge of DTOE in the first half of 
state 0 after state 21. Early entry into the cycle ensures that 
the load can be made before the end of the window. The 
MIDSCAN signal holds the sequencer so that the load is not 
too early. 


The mid-scan line load feature allows contiguous mapping 
of the screen buffer as shown in Figure 9, resulting in more 
efficient use of screen buffer memory. A screen size of 480 
x 640 uses 307,200 pixels, 29.3% of the 1,048,576 pixels 
available in the VDRAM. Two buffers of this size can be 
used with 27,136 words of free memory remaining in each 
of the four planes for programs, data, character fonts and 
graphics images. In a system without mid-scan line load, 
using 640 pixels per horizontal scan line, 384 pixels (37.5%) 
of each shift register load would be unused. This scheme 
would also consume more memory bandwidth for screen 
refresh because only 62.5% of the pixels in each load are 
used, requiring the shift register to be loaded more frequent- 
ly. 

Figure 13 shows a code sequence for initializing the video 
parameters; these parameters are different than those of 
the DP850EB board. The blanking signal must be generated 
two counts earlier in this design for the mid-scan line shift 
register load circuitry. 


3.8 PROM 


The board has two PROMs, U14 and U15, which reside at 
locations OOOOOOH-0007FFH. CK2 is used to clock their 
internal registers. They hold the monitor which is executed 
by the RGP upon reset of the board. The monitor communi- 
cates with the host-resident RGP debugger through the 
communication area in VDRAM when using the parallel in- 
terface, or through the UART when using the serial inter- 
face. The monitor also uses a small area in VDRAM for 
variable storage, which must remain undisturbed if the moni- 
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tor is to be reentered. To run programs on the RGP, the 
host loads the code into VDRAM through the parallel inter- 
face, then issues a command to the monitor to branch to it. 
The details are discussed in the debugger/monitor docu- 
mentation. 


3.9 UART 


The UART (U26) is used for serial communication with the 
monitor program. U28 generates the clock for the UART. 
The line transceiver is U27. W5 allows pins 2 and 3 of the 
RS-232 connector to be switched. W6 allows pins 6 and 20 
to be switched. A serial connection is optional; the monitor 
can be accessed through the AT interface by using the 
communication area in memory. 


4.0 OTHER DESIGN ISSUES 


The pixel clock frequency could be increased to display 
higher screen resolutions. Higher resolutions may require 
more memory; the set of VDRAMs on the board will hold a 
single buffer of 1k x 1k. The RGP can directly support 
screen buffer sizes of up to 16k x 16k, though schemes can 
be used to access larger screen buffers. 


_ Memory planes can be added to increase the number of bits 


per pixel. The number of planes that can be added is not 
limited by the RGP; the rendering algorithms execute at the 
same speed, independent of the number of planes. A pail- 
lette/DAC could be added to generate analog video signals 
and allow selection of various color sets. 


lf a larger PLD was used for the main sequencer in U200, 
then ALE could be latched inside it instead of U5. This 
scheme would give more set-up time for ALE, allowing a 
slower PLD to be used. The larger PLD also could accom- 
modate additional features such as allowing the AT to initi- 
ate drawing cycles so that single pixel values could be writ- 
ten and read. 


In the AT interface circuitry, the data write latches and lower 
address latches are used as buffers. If the circuitry was al- 
tered, they could be used as latches to pipeline writes from 
the host. This would allow the host to deposit data and the 
address of the data to be written without being held during 
the entire write cycle. The host could then be obtaining the 
next word to be written while the previous word is propagat- 
ing into the RGP’s memory. The AT would be held if it at- 
tempted to write a second word before transfer of the first 
word was completed. This technique would result in a slight- 
ly more complicated host bus interface machine, but would 
reduce the host bus bandwidth spent writing to the board. 
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TrOaTNmMIOOD>D> 


000000 


DRAWING 
000101 


DISPLAY 
REFRESH 


010001 


DRAM 
REFRESH 


010000 


HOST 
ACCESS 


100001 


INSTRUCTION 
/OPERAND 


000001 


ALE1 * BS1 * BSO 
RRQ 

ALE1 * —BS1 

ALE1 

—MUXW * —MUXR 

MIDSCAN 

RRQ + —REMREO + —PB + —BS1i + BSO + —ALE1 


FIGURE 4. Flowchart for the Arbiter/Timing Generator State Machine 
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TL/F/10429-4 


INPUTS 


ABBMMMPRR 
LSSIUUBER 
EO1DXX~MQ 
1 SRW R 
Cre 
A Q 
N oe 


111XXXXXX 
11LO0XXXXX1 
110XXXxX00 
110XXXX10 
LOXXXXXX1 
1OXXXXX00 
100XXXX10 
101XXXX10 
OXXXXXXX1 
OXXXXXX00 
OXXXXXX10 
XXXXXXXXX 
XXXXXXXXX 
XXXL XXXXX 
XXXOXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXOOXXX 
XXXXO1XXX 
XXXX1XXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXX1 
XXXXXXXOX 
XXXXXXOXX 
XXOXXXXXX 
X1XXXXXXX 
OXXXXXXXX 


PPPPPP 
RSSSSS 
at ht bed hada 
M43210 
A 
C 
K 


000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
000000 
010001 
010110 
010010 
010010 
010100 
000010 
100110 
000110 
010000 
100001 
000001 
000101 
000101 
000101 
010111 
011000 
011001 
010101 
011010 
100011 
100100 
000011 
000100 
000111 
001000 
001000 
001000 
001000 
001000 
001000 


OUTPUTS 


NNNNNN 
RSSSSS 
ETTTTT 
M43210 
A 
C 
K 


010001 
010000 
100001 
000001 
010000 
100001 
000001 
000101 
010000 
100001 
000000 
010110 
010010 
010010 
010100 
010101 
000111 
100011 
000011 
010111 
100110 
000110 
001010 
000010 
000010 
011000 
011001 
011010 
000000 
000000 
100100 
100000 
000100 
000000 
001000 
000000 
000000 
000000 
000000 
000000 
000000 
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INPUTS 


101X00110 
101X01110 
101X1X110 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXX1 
XXXXXXX0X 
XXXXXXOXX 
XXO XXXXXX 
X1XXXXXXX 
OXXXXXXXX 
101X00110 
101X01110 
101X1X110 
111XXXXXX 
110XXXXX1 
110XXXX00 
110XXXX10 
1OXXXXXX1 
1OXXXXX00 
100XXXX10 
101XXXX10 
OXXXXXXX1 
OXXXXXX00 
OXXXXXX10 


001000 
001000 
001000 
001010 
001011 
001100 
001101 
001110 
001110 
001110 
001110 
001110 
001110 
001110 
001110 
001110 
100000 
100000 
100000 
100000 
100000 
100000 
100000 
100000 
100000 
100000 
100000 


OUTPUTS 


001010 
000010 
000010 
001011 
001100 
001101 
001110 
000000 
000000 
000000 
000000 
000000 
000000 
001010 
000010 
000010 
010001 
010000 
100001 
000001 
010000 
100001 
000001 
000101 
010000 
100001 
000000 


FIGURE 5. Equations for the Arbiter/Timing Generator, before Minimization 


3-91 


TL/F/10429-6 


609-NV 


AN-609 


INPUTS 


ABBMMMPRR 
LSSIUUBER 
EO01DXX~MQ 
1 SRW R 
Coc -B 
A Q 
N ~ 


XOXXXXX00 


OXXXXXX00 
XXXXXXXKXX 
XXXXXXXXX 


XXOXXXX00 


XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXX1 
111XXXXXX 


XXXXXXXXX 
XXXXXXAXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXO0OXXX 
101x00110 


XXXXXXXXX 
XXXXXXXXX 
XXXOXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
101XXXX10 


XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXX 1LXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
101XXX110 
101XXX110 


XXXXXXXXX 
XXXXXXXXX 
LXXXXXXXO0 
XXXXXXX00 
XXXXXXXXX 
XXXXXXXXX 
XXXXXXXXX 
1L11XXXXXX 


PPPPPP 
RSSSSS 


ETTTTT 


M43210 


A 


C 
K 


X00000 
X00000 
1000X1 
1001X0 
X00000 


01X00X 
01011X 
010XX0 
X00000 
X00000 


00110X 
00101xX 
01100X 
OX0111 
00X101 
001XX0 


010xX00 
01000X 
0X0010 
00001X 
X000X1 
00xX011 
00110x 
X0000X 


01000X 
01xX001 
0xX0110 
0xX0X10 
X00001 
00xX010 
X00110 
00X101 
0010X0 
00XX10 


01xX000 
010xX00 
X00000 
X00000 
001100 
00X010 
X00110 
X00000 
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OUTPUTS 


NNNNNN 
RSSSSS 
ETTTTT 
M43210 
A 
C 
K 


100000 
100000 
100000 
100000 
100000 


010000 
010000 
010000 
010000 
010000 


001000 
001000 
001000 
001000 
001000 
001000 


000100 
000100 
000100 
000100 
000100 
000100 
000100 
000100 


000010 
000010 
000010 
000010 
000010 
000010 
000010 
000010 
000010 
000010 


000001 
000001 
000001 
000001 
000001 
000001 
000001 
000001 
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FIGURE 6. Equations for the Arbiter/Timing Generator, after Minimization 


INPUTS 


RWRW 
1111 
~~DD 


RSSSSS 
ETTTTT 
M43210 


ee Oe me 


X11111 
111110 


111001 
111001 


111100 
111100 


111011 
111011 
111011 


011110 
011001 


011100 
011100 


011011 
011011 
011011 


X11010 


X11101 
X11101 


X11000 
X11000 


X10111 
X10111 
X10111 
X10111 


X10101 
X10100 


OUTPUTS 
UWDDD 
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hslt_val: equ 47 

hse_val: equ 2 

hbe_val : equ 2 

hbs_val: equ 42 

vft_val : equ 492 

vse_val: equ 2 

vbe_val : eequ ill 
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screen_warp: -equ 40 
turnonvideo: mov #Stop_ver, ver 


mov #nslt_val, video 
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mov #nbS.val, video 
mov #vft.val, video 
mov #vse.val, video 
mov #vbe.val, video 
mov #vbS_val, video 
mov #Sscreen_base, dbb 
mov #sScreen_warp, dbwrp 
mov #run.ver, ver 


ret 
FIGURE 13. Instructions for Setting Video Parameters 
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M5M4C264L 
j 
> 
nee 
O 
16V8_25 
LD~ ; 
A | 
2 VD3 ~ 
<j 
Ws . 
OR 74LS195A CENTERED IN SOCKET | if 
VA 20 pP8511V apr 
a /) aa ii 
VAI 22 
VAG 25 470_0HM 
RB 
QO 
> 


VAi 22 
VAO 25 


WRe Sic 
© 
SDIOE 10 


A RAS~ 14 


¢, 


SEQ~ 3 


TL/F/10429-24 


FIGURE 14m 
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MODULE U100R 

TITLE 

‘video Shift Register. U110, U120, U130 are same. 74LS195A 
equiv. Device:U100 Filename:U100R 

Jim Margeson 

Date: 12/15/88 

Revision: 2.0! 


609-NV 


U1O0ORJ device ‘pl6v8r'; 


CK1C pin Le 
NC2 pin 2; 
NC3 pin 37 
NC4 pin 4; 
PNA3 pin 37 
PNA2 pin 6; 
PNA1 pin 73 
PNAO pin 8; 
NC9 pin 9; 
GND pin 10; 
OE~ pin 11; 
LD~ pin 12; 
NC13 pin ple 
o5 pin 14; 
VDO pin 133 
04 pin 16; 
03 pin LT; 
NC18 pin 18; 
NC19 pin 19; 
vcc pin 20; 
Equations 


!vD0 := !LD~ & !PNAO # LD~ & !05; 
05 := !LD~ & !PNAl # LD~ & !04; 
104 := !LD~ & !PNA2 # LD~ & !03; 
{03 := !LD~ & !PNA3; 


TL/F/10429-25 
FIGURE 15. Equations for U100 
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MODULE U200R 

TITLE 

‘Main Bus Sequencer (from B5) 
Device:U200 Filename:U200R 
Jim Margeson 

Date: 12/15/88 

Revision: 2.0' 


U200RJI device 'p20v8r'; 


CK1A pin 1% 
ALE1 pin 2; 
BS0 pin 3; 
BS1 pin 4; 
MIDSCAN pin 5; 
MUXR~ pin 6; 
MUXW~ pin 12 
PB~ pin 8; 
REMREQ~ pin 9; 
RRO pin 10; 
RSTI~ pin 11; 
GND pin 12; 
OE~ pin 13; 
DRREQ~ pin 14; 
REMACK pin 15; 
REMACK~ pin 16; 
ST4~ pin 17; 
ST3~ pin 18; 
ST2~ pin 19; 
ST1~ pin 20; 
ST0~ pin 21; 
FBST1~ pin 22; 
NC23 pin 23; 
vcc pin 24; 
Equations 


REMACK := RSTI~&!BS0&! REMREQ~& !RRQE&ST4~&ST3~&ST2~&ST1~&STO~ 
# RSTI~&! ALE1& ! REMREQ~& ! RRQ&ST4~&ST3~&ST2~EST1~&STO~ 
# RSTI~& ! REMACK~&ST4~&ST3~&ST2~&! STO~ 
¥ RSTI~&! REMACK~&ST4~&ST3~&!ST2~&STO0~ 
# RSTI~&!BS1&! REMREQ~& ! RRO&ST4~&ST3~&ST2~&ST1~E&STO~; 


!'REMACK~ := RSTI~&!BS0&! REMREQ~&! RRQO&ST4~&ST3~&ST2~&ST1~&STO~ 
# RSTI~& !ALE1&! REMREQ~& ! RRQ&ST4~&ST3~&ST2~&ST1~&STO~ 
RSTI~é& ! REMACK~&ST4~&ST3~&ST2~&!STO~ 
# RSTI~& ! REMACK~&ST4~&ST3~&!ST2~&STO~ 
# RSTI~&!BS1&! REMREQ~& !RRQ&ST4~&ST3~&ST2~&ST1~&STO~; 


'ST4~ := RSTI~&REMACK~&!ST4~&ST2~&ST1~ 
# RSTI~&REMACK~&!ST4~&ST3~&!ST2~&!ST1~ 
# RSTI~&REMACK~&!ST4~&ST3~&STO~ 
# RSTI~&RRQEST4~&ST3~&ST2~E&ST1I~&STO~ 
# RSTI~&ALE1L&BSOS&BS1&ST4~E&ST3~&ST2~&ST1~&STO~; 


'ST3~ := RSTI~&REMACK~&ST4~&!ST3~&!ST2~&ST1~ 


TL/F/10429-26 


FIGURE 16. Equations for U200 
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RSTI~&REMACK~&ST4~&!ST3~&ST2~&!ST1~ 

RSTI~&REMACK~&!ST4~&!ST3~&ST2~&ST1~ 

RSTI~&REMACK~&ST3~&!ST2~&!ST1~&!STO~ 

RSTI~& !MUXR~& !MUXW~&REMACK~&ST4~&!ST2~&ST1~&!STO~ 

RSTI~&ALE1&!BSO0&BS1& ! MUXR~& ! MUXW~&PB~&REMREQ~& ! RRQ&REMACK~ 
&ST4~&!ST3~&ST0~; 


609-NV 


isTt2~ RSTI~&REMACK~& !ST4~&ST3~&ST1~&STO~ 
RSTI~&REMACK~&!ST4~&ST3~&ST2~&ST1~ 

RSTI~& !MIDSCAN& REMACK~&ST3~&ST2~&!ST1~&STO~ 
RSTI~&REMACK~&ST4~&ST3~&ST2~&!ST1~ 
RSTI~&ST4~&ST3~&ST2~&!STO~ 
RSTI~&REMACK~&ST4~&ST2~&!ST1~&!STO~ 
RSTI~&REMACK~&ST4~&!ST3~&!ST2~&ST1~ 


RSTI~&ALE1& !BSO0&BS1&REMREQ~& ! RRQ&ST4~&ST3~&ST2~&ST1~; 


he He He He 


'ST1~ := RSTI~&ALE1& ! BSO&BS1&PB~&REMREQ~& ! RRQ& REMACK~ 
&ST4~&!ST3~&ST2~&STO~ 

RSTI~&ALE1& !BS0&BS1&PB~&REMREQ~& ! RRQ&REMACK~&ST4~& !ST1~&STO~ 

RSTI~&MIDSCAN&REMACK~&ST3~&! ST1~&STO~ 

RSTI~&REMACK~&! ST4~&ST3~&ST2~&ST1~ 

RSTI~&REMACK~& ! ST4~&ST2~&ST1~&!STO~ 

RSTI~&REMACK~&ST3~&!ST2~&!ST1~&STO~ 

RSTI~&ST4~&ST3~&ST2~&ST1~&!STO~ 

RSTI~&!FBST1~; 


ae ie ee Re OE MK Me 


tFBST1~ = RSTI~&REMACK~&ST4~&ST2~&!ST1~&STO~ 
# RSTI~&ST4~&ST3~&!ST2~&!ST1~&STO~ 
# RSTI~&REMACK~&ST4~&!ST2~&ST1~&!STO~; 


!STo~ 


RSTI~&REMACK~& !ST4~&ST2~&ST1~&STO~ 
RSTI~&REMACK~&! ST4~&ST3~&ST1~&STO~ 
RSTI~&ALE1& ! RRQEST4~&ST3~&ST2~&ST1~&STO0~ 
RSTI~& ! REMREQ~& ! RRQ&ST4~&ST3~&ST2~&ST1~&STO~ 
RSTI~&REMACK~&ST4~&!ST3~&!ST2~&ST1~&STO~ 
RSTI~&REMACK~&ST4~&ST2~6!ST1~&STO~ 
RSTI~&ST4~&ST3~&!ST2Z2~&!ST1~&STO~ 
RSTI~&ALEL&BSO&BS1EST4~&ST3~&ST2~&ST1~&STO~; 


me He ie Me Me ae ae 0 


End; 


TL/F/10429-27 
FIGURE 16. Equations for U200 (Continued) 
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'WR~ 


MODULE U201R 


TITLE 


‘Control Signal Generator #1 (from B5-3) 
Device:U201 Filename:U201R 


J. Margeson. 


Date: 12/15/88 


Revision: 


20" 


U201RJI device 'p20v8r'; 


CK1A 
sT0~ 
ST1~ 
ST2~ 
ST3~ 
ST4~ 
REMACK~ 
MUXR~ 
MUXW~ 
MUXWD ~ 
MUXRD~ 
GND 

OE~ 
NC14 
UARTWR~ 
WR~ 
DOE~ 
NCASIN~ 
RAS~ 
DTOE~ 
DLE~ 
ST634 
NC23 
vce 


Equations 


{UARTWR~ 


pin a 
pin Zi 
pin 3; 
pin 4; 
pin 33 
pin 6; 
pin 73 
pin 8; 
pin 9; 
pin 10; 
pin 11; 
pin 12; 
pin 13; 
pin 14; 
pin 3 Oe 
pin 16; 
pin pea 
pin 18; 
pin 19; 
pin 20; 
pin 21; 
pin Ze; 
pin 23; 
pin 24; 


>= MUXR~&!MUXW~& ST4~& ST3~&!ST2~&!ST1~& STO~ 


# MUXR~&!MUXW~& ST4~& ST3~& ST2~&!ST1~&!STO~; 


# 
# 


!DTOE~ 


'DLE~ 


ae ate aE AR Ae OK te 


!MUXWD~&ST4~&!ST3~&ST2~&ST1~ 
'ST3~&!ST2~&!ST1~ 
MUXR~& !MUXW~&ST4~&ST3~6!ST2~&ST1~&ST0~; 


!'MUXR~&!ST2~&ST1~&STO~ 

'ST3~&!ST2~&ST1~&STO~ 
'ST4~&ST3~&ST2~&!STO~ 
'ST4~&ST3~&ST2~&!ST1~ 
MUXWD~&ST4~&!ST3~&ST1~&STO~ 
'MUXR~&ST4~&!ST1~&!STO~ 
1ST3~&!ST1~&!STO~ 
'ST4~&!ST2~&STO0~; 


!ST3~&!ST2~&ST1~&STO~ 
!MUXRD~&MUXWD~&ST4~&!ST3~&ST1~&STO~; 


FIGURE 17. Equations for U201 
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!DOE~ := !MUXW~&ST4~&ST3~&ST2~&!ST1~&STO~ 
# !MUXWD~&ST4~& !ST3~&ST2~&ST1~ 
# 'ST3~&!ST2~&!STO~ 
# 'ST3~&!ST2~&!ST1~ 
# MUXR~&ST4~6&!ST2~&!ST1~&!STO~; 
'RAS~ := !ST4~&!ST2~&ST1~ 


# !'ST2~&STO0~ 

# ST4~&!ST1~ 

# ST3~&!ST1~&STO~ 
# !ST3~&ST1~; 


'NCASIN~ := ST3~&!ST1~ 
# ST4~&ST2~&!ST1~ 
# 'ST3~&!ST2~EST1~ 
# 'ST4~&ST1~&STO~; 


!ST634 := !ST2~&!STO~ 


# !ST2~&ST1~ 
# 'sT3~ 
# 'sSsT4~ 
# ST2~&STO~; 


End; 
TL/F/10429~-29 


FIGURE 17. Equations for U201 (Continued) 
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MODULE U202R 


TITLE 


"Control Signal Generator #2 (from B5-3) 
Device:U202 Filename: U202R 

Jim Margeson 

Date: 12/15/88 


Revision: 


2.0! 


U202RJ device 'p20v8r'; 


CK1A 
sT0~ 
ST1l~ 
ST2~ 
ST3~ 
ST4~ 
REMACK~ 
MUXR~ 
MUXW~ 
RSTI~ 
NC11 
GND 


Equations 


!DDIN := 


# 


pin 1; 
pin 2; 
pin 3; 
pin 4; 
pin 33 
pin 6; 
pin ; 
pin 8; 
pin 9; 
pin 10; 
pin ll; 
pin 12; 
pin 13; 
pin 14; 
pin 15; 
pin 16; 
pin 17; 
pin 18; 
pin 19; 
pin 20; 
pin 21; 
pin 22; 
pin 23; 
pin 24; 


!MUXR~&ST4~&ST3~&!ST2~ESTO~ 
!MUXR~&ST3~&ST2~&!ST1~E!ST0~; 


'DBE~ := REMACK~&ST4~&ST3~&!ST2~&ST0~ 


# 


'RDY : 


'RSTA 


!RSTRRQ~ 


LE~ 
# 
# 
# 


REMACK~&ST3~&ST2~&!ST1~&!STO~; 


'ST3~&!ST1~ 
'ST4~&!ST1~ 
$T2~&ST1~ 
ST1~&!STO~ 
!'REMACK~& !ST0~ 
ST4~&ST3~&!ST2~; 


:= REMACK~&ST3~&ST2~&ST1~&! STO~ 
1ST4~&ST3~&ST2~&!ST0~ 
ST4~&ST2~&!ST1~&STO~ 

!'RSTI~; | 


:= !ST4~&ST3~&ST2~6ST1~&ST0~ 


TL/F/10429-30 
FIGURE 18. Equations for U202 
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End; 


# 


!RSTI~; 
ST4~; 


!SsTt4a~ 

!ST3~ 
'ST2~6&!ST0~ 
ST1~&!STO~ 
ST2~&STO~; 


STO~ 
STl~ 
ST2~ 
!ST4~; 


TL/F/10429-31 
FIGURE 18. Equations for U202 (Continued) 
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MODULE U203R 

TITLE 

"CAS 

Device:U203 Filename:U203R 
Jim Margeson 

Date: 12/15/88 

Revision: 2.0' 


U203RJ device 'p20v8r'; 


CK1D pin 1; 
A23 pin 23 
A22 pin 3; 
A21 pin 4; 
A18 pin 53 
Al7 pin 6; 
WMASKO pin V3 
WMASK1 pin 8; 
WMASK2 pin 9; 
WMASK3 pin 10; 
ST8 pin 11; 
GND pin 12; 
OE~ pin 13; 
RFIP pin 14; 
MUXR~ pin 15; 
NC16 pin 16; 
CAS0~ pin 17; 
CAS1~ pin 18; 
CAS2~ pin 19; 
CAS3~ pin 20; 
A20 pin 21; 
LADDR~ pin 22; 
NCASIN~ pin 23; 
vcc pin 24; 
Equations 


'CAS3~ := !LADDR~ & !A23 & A22 & Al8 & Al7 & !NCASIN~ & !ST8 
# LADDR~ & WMASK3 & !NCASIN~ & !ST8 
# RFIP & !NCASIN~ & !ST8 
# ST8 & !CAS3~; 


'CAS2~ := !LADDR~ & !A23 & A22 & A18 & '!Al17 & '!NCASIN~ & !ST8 
# LADDR~ & WMASK2 & !NCASIN~ & !STB8 
# RFIP & !NCASIN~ & !ST8 
# ST8 & !CAS2~; 


'CAS1~ := !LADDR~ & !A23 & A22 & !A18 & Al7 & !NCASIN~ & !ST8 
# LADDR~ & WMASK1 & !NCASIN~ & !ST8 
# RFIP & !NCASIN~ & !ST8 
# ST8 & !CAS1~; 


'CASO~ := !LADDR~ & !A23 & A22 & !A18 & 'A17 & !NCASIN~ & !ST8 
# LADDR~ & WMASKO & !NCASIN~ & !ST8 
# RFIP & !NCASIN~ & !ST8 
# ST8 & !CASO~; 


End; 


FIGURE 19. Equations for U203 
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MODULE U204R 


TITLE 


"Device Enables 


Device:U204 


Jim Margeson 


Date: 


Revision: 


12/15/88 
2.0! 


U204RJ device 'pl6v8c'; 


IOEN 
A23 
A22 

A21 

A20 

A3 

A2 

Al 

A0 

GND 
MUXW~ 
FSE3~ 
FSE2~ 
FSE1~ 
FSEO~ 
CRE~ 
ENUART~ 
ENP ROM~ 
LWMASK 
vcc 


Equations 


1 FSE3~ 
'FSE2~ 
'FSE1~ 
!'FSEQ~ 
!CRE~ 


! ENUART~ 
! ENPROM~ 


! LWMASK 


End; 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


A23 
A23 
A23 
A23 
A23 
A23 
'A23 
1A23 


Me MM MM KM KM 


TOM MO MM MM 


Filename:U204R 


A21 
A21 
A21 
A21 
'A21 
'A21 
1A21 
!A21 


TOM Mm MM OM KM 


'A20 
'A20 
'A20 
'A20 
'A20 
'A20 
'A20 

A20 


e 


IOEN; 

IOEN; 

IOEN; 

IOEN; 

tA2 & !Al & AO & IOEN; 
IOEN; 

IOEN & MUXW~; 

IOEN # MUXW~; 


FIGURE 20. Equations for U204 
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MODULE U205R 

TITLE 

'BE 

Device:U205 Filename 
Jim Margeson 

Date: 12/15/88 
Revision: 2.0' 


U205RJ device 'p20v8r 


CK1A pin 1; 
ST8 pin 2; 
ST634 pin 3; 
A23 pin 4; 
A22 pin 37 
A21 pin 6; 
A20 pin Ts 
Als pin 8; 
A17 pin 9; 
A3 pin 10; 
A2 pin 11; 
GND pin LZ; 
OE~ pin 13; 
Al pin 14; 
DRCYCLE~ pin 15; 
INTPL pin 16; 
BE3~ pin 17; 
BE2~ pin 18; 
BE1~ pin 19; 
BEO~ pin 20; 
NC21 pin 21; 
BSE pin 22; 
AO pin 23; 
VCC pin 24; 
Equations 


:U205R 


'BE3~ := ST634 & !A23 & A22 & Al8 & Al7 


# ST634 & A23 & !A22 & !A21 & !A3 & 1A2 & 


# ST634 & A23 & 
# INTPL & BSE & 
# ST8 & !BE3~; 


!A22 & A21 &A3 
!DRCYCLE~ & !ST8 


{BE2~ := ST634 & !A23 & A22 & A18 & !A17 


# ST634 & A23 & 
# ST634 & A23 & 
# INTPL & BSE & 
# ST8 & !BE2~; 


'A22 & !A21 & !A3 & 
'A22 & A21l & A2 
'DRCYCLE~ & !ST8 


'BE1~ := ST634 & !A23 & A22 & !A18 & Al7 


# ST634 & A23 & 
# ST634 & A23 & 
# INTPL & BSE & 
# ST8 & !BEl~; 


'A22 & !A21 & !A3 & 
'A22 & A21l & Al 
'DRCYCLE~ & !ST8 


'BEO~ := ST634 & !A23 & A22-& !A18 & !A17 


# ST634 & A23 & 
# ST634 & A23 & 
# INTPL & BSE & 
# ST8 & !BEO~; 


End; 


'A22 & !A21 & !A3 & 
'A22 & A21 & AO 
'DRCYCLE~ & !ST8 


'A2 & 


'A2 & 


'A2 & 


'Al & AO 


'Al & AO 


!Al & AO 


'Al & AQ 


FIGURE 21. Equations for U205 
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MODULE U20 
TITLE 

] 
Device:U206 
Jim Margeso 
Date: 12/15 
Revision: 2 


6R 


Filename :U206R 


n 
/88 
.0' 


U206RI device 'p20v8r'; 


CK1D 
MUXR~ 
IOEN 
A23 


RGPINTRES~ 
ATINTWR~ 
AO 

vcc 


Equations 


uu 
HoH 
ww 
N WwW 
1 | 


4°) 
0 
Et 
re 
2 
il 


ro 
O 
| 
2 
I 


!'RGPINTRES~ 
'ATINTWR~ 


End; 


pin 13 
pin 2; 
pin 3; 
pin 4; 
pin 33 
pin 6; 
pin 73 
pin 8; 
pin 9; 
pin 10; 
pin 11; 
pin 12; 
pin 13; 
pin 14; 
pin 15; 
pin 16; 
pin 17; 
pin 18; 
pin 19; 
pin 20; 
pin 21; 
pin 22; 
pin 23; 
pin 24; 


MUXR~ # A23 & !A22 # !A18 # !A17; 
MUXR~ # A23 & !A22 # !A18 # Al17; 
MUXR~ # A23 & !A22 # A118 # !A17; 
MUXR~ # A23 & 'A22 # A18 # A177; 


A23 & !A22 & 'A21 & !A20 & !A3 & !A2 & Al & !AO & IOEN; 


A23 & !A22 & 'A21 & '!A20 & '!A3 & A2 & !Al & !A0 & IOEN; 


= A23 & 


A23 & 


A22 & 'A2l1 & A20 & IOEN & MUXR~; 


A22 & A21 & A20 & IOEN & MUXR~; 


FIGURE 22. Equations for U206 
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MODULE U207R 

TITLE 

‘MIDSCAN, BLANKD~ GENERATOR 
Device:U207 Filename:U207R 
Jim Margeson 

Date: 12/15/88 

Revision: 2.0! 


U2Z07RJ device 'pl6v8r'; 


LCK pin ni 
BLANK~ pin 2; 
DRREQ~ pin 33 
RST~ pin 4; 
L_DR_REQ~ pin 33 
I5 pin 6; 
I6 pin a3 
17 pin 8; 
I8 pin 9; 
GND pin 10; 
19 pin 11; 


DR_REQ pin 12; 
RST_REQ~ pin 13; 


DD pin 14; 
F3 pin 15; 
D_BLANK~ pin 16; 
FS pin 17; 
F6 pin 18; 
MISCAN pin 19; 
vcc pin 20; 
Equations 


DR_REQ = !DRREQ~; 

F3 := BLANK~; 

DD = D_BLANK~; 

D_BLANK~ := F3; 

RST_REQ~ = RST~ & !F6; 

FS := !L_DR_REQ~ & IFS; 

F6 := !L_DR_REQ~ & F5 & !F6; 
MISCAN = !F6 & D_BLANK~; 


End; 
TL/F/10429-—36 
FIGURE 23. Equations for U207 
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MODULE U208R 
TITLE 

‘DRAM REFRESH 
Device:U208 
Jim Margeson 
Date: 12/15/88 
Revision: 2.0! 


REQUEST, GSC, 
Filename:U208R 


READ + WRITE MUX 


U208RJ device 'pl6v8r'; 
LCKL1 pin 1; 
LCKL pin 2; 
BLANKD~ pin 3: 
NC4 pin 4; 
R~ pin 33 
W~ pin 6; 
REMRL pin 7; 
REMWL pin 8; 
REMACK pin 9; 
GND pin 10; 
OE~ pin Li; 
GSC pin iz; 
06 pin 13; 
O5 pin 14; 
04 pin 15; 
03 pin 16; 
VRRQ pin 17; 
MUXR~ pin 18; 
MUXW~ pin 19; 
vcc pin 20; 
Equations 
'VRRQ := !VRRQ & O03 & O04 & O5 
# 'VRRQ & O03 & O04 & O06 
# VRRQ & !03 & !04 & !05 & !06; 
103 := VRRQ & !03 & O5 
# VRRQ & !03 & 04 
# VRRQ & !03 & 06 
# VRRQ & O03 & !04 & !05 & !06; 
104 >= VRRQ & !04 & 06 
# VRRO & !04 & O5 
# VRRQ & 04 & !05 & !06; 
105 >= VRRQ & O5 & !06 
# VRRQ & !05 & O06 


# 03 & 04 & !05 & 06 


# 03 & 04 & OS & !06; 
106 >= O3 & O04 & O06 

# VRRQ & O6; 
{GSC = !BLANKD~ # !LCKL; 
'MUXW~ = !W~ & !'REMACK 

# REMWL & REMACK; 
'MUXR~ = !'R~ & !REMACK 

# REMRL & REMACK; 
End; 


FIGURE 24. Equations for U208 
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MODULE U209R 

TITLE : 

q 

Device:U209 Filename:U209R 
Jim Margeson 

Date: 12/15/88 

Revision: 2.0' 


U209R device 'pl6év8r'; 


LCKO pin as 
DDBSE pin 23 
INTPL pin So: 
BS1 pin 4; 
BSO : pin ; 
REMACK pin 6; 
ATINTRES~ pin a3 
ATINTWR~ pin 8; 
DO pin 9; 
GND pin 10; 
OE~ pin 11; 
DRCYCLE~ pin 12; 
LADDR~ pin 13; 
LCK pin 14; 
LCKOD4 pin 15; 
LCKOD2 pin 16; 
RSTI~ pin 17; 
ATINT~ pin 18; 
NC19 pin 19; 
Vcc pin 20; 
Equations 


!1LCKOD2 := LCKOD2; 


{LCKOD4 := !LCKOD4 & LCKOD2 
# LCKOD4 & !LCKOD2; 


'LCK := !LCK & LCKOD4 
# !LCK & LCKOD2 
# 4LCK & !LCKOD4 & !LCKOD2; 
!DRCYCLE~ = BS1 & !BSO & !REMACK; 
! LADDR~ = DDBSE & INTPL & BS1 & !BSO 
# !BS1 
# REMACK; 
'ATINT~ = DO & !ATINTWR~ & RSTI~ 
# DO & !ATINT~ & ATINTRES~ & RSTI~ 
# ATINIWR~ & !ATINT~ & ATINTRES~ & RSTI~; 


End; 


FIGURE 25. Equations for U209 
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~TL/F/10420-98 


MODULE U210R 

TITLE 

'SE~, LD~ 

Device:U210 Filename:U210R 
Jim Margeson 

Date: 12/15/88 

Revision: 2.0' 


U210RJ device 'pl6v8r'; 


CK1C pin 1; 
GSC pin 2; 
BLANKD~ pin 3; 
NC 4 pin 4; 
NCS pin 37 
NC6 pin 6; 
NC7 pin a3 
NCB pin 8; 
NC9 pin 9; 
GND pin 10; 
OE~ pin 11; 
SE3~ pin 12; 
SE2~ pin 13; 
LD~ pin 14; 
ro}s) pin 15; 
04 pin 16; 
03 pin Ls 
SE1~ pin 18; 
SEO~ pin 19; 
vcc pin 20; 
Equations 

'SEQO~ = O3 & GSC; 
!SE1l~ = !03 & GSC; 
'SE2~ = O3 & !GSC; 
!'SE3~ = !03 & !GSC; 
03 = !03 & O5 & BLANKD~ 


# !03 & O4 & BLANKD~ 
# O3 & !04 & !05 & BLANKD~; 


104 >= O05 & !'04 & BLANKD~ 
# !05 & O04 & BLANKD~; 

£05 >= OS & BLANKD~; 

!'LD~ := O04 & OS; 

End; 


FIGURE 26. Equations for U210 
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TL/F/10429-39 


609-NV 


AN-609 


MODULE U211R 


TITLE 


"latch and blank video data 
Device:U211 
Jim Margeson 
Date: 12/15/88 


Revision: 


2.0' 


Filename:U211R 


U211RJ device 'plév8r'; 


BLNKDLL 


Equations 


!GSCL 

! BLANKDL~ 
! BLNKDLL 
! BLNKDLLL 


!v0 
!vl1 
!v2 
'v3 


End; 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


ee 80 608 880 
a 


!vD0 # BLNKDLLL; 
'VD1 # BLNKDLLL; 
!VD2 # BLNKDLLL; 
!'VD3 # BLNKDLLL; 


FIGURE 27. Equations for U211 
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TL/F/10429-—40 


MODULE U220R1 

TITLE 

"Host Address Decode and 16-bit Transfer Request 
Device:U220 Filename :U220R1 

Jim Margeson 

Date: 1/23/88 

Revision: 2.01' 


U220RJ1 device 'pl6v8c'; 


AEN pin 1; 
REMLA2 3 pin 23 
REMLA22 pin 3; 
REMLA21 pin 4; 
REMLA2 0 pin 2 
REMLA19 pin 7 
REMLA18 pin 7; 
REMLA17 pin 8; 
REMA16 pin 9; 
GND pin 10; 
BALE pin a1? 
NC12 pin 12; 
BDSELL~ pin 13; 
ASOA~ pin 14; 
AS0D~ pin 15; 
ASAO~ pin 16; 
M160A~ pin L7; 
M160D~ pin 18; 
M16A0~ pin 19; 
vcc pin 20; 
Equations 

'M160A~ = 'ASOA~ # ASOA~ # !ASOD~ # ASOD~ # 


ENABLE M160A~ 'REMLA23 & !REMLA22 & !REMLA21 & 


& !REMLA18 & REMLAI7 & 'ASOA~; 


'M160D~ , = !ASOA~ # ASOA~ # !ASOD~ # ASOD~ # !ASAQO~ # ASAO~; 
ENABLE M160D~ = !REMLA23 & !REMLA22 & !REMLA21 & !REMLA20 & REMLAI19 
& REMLA18 & !REMLAI7 & 'ASOD~; 
!M16A0~ ~ ‘ASOA~ # ASOA~ # !ASOD~ # ASOD~ # !ASAO~ # ASAO~; 
ENABLE M16A0~ = REMLAZ3 & !REMLA22 & REMLA21 & !REMLA20 & !REMLA19 
& !REMLA18 & !REMLA17 & !ASA0~; 
!BDSELL~ = !AEN & !REMLA23 & !REMLA22 & !REMLA21 & !REMLA20 & REMLA19 
& !REMLA18 & REMLAI7 & !REMAI6 & !ASOA~ & BALE 
# !'AEN & !REMLA23 & !REMLA22 & !REMLA21 & !REMLA20 & REMLA19 
& !REMLA18 & REMLA17 & !REMA16 & '!ASOA~ & !BDSELL~ 
# 'AEN & !REMLA23 & !REMLA22 & !REMLA21 & !REMLA20 & REMLA19 
& REMLA18 & !REMLA17 & REMA16 & !ASOD~ & BALE 
# !AEN & !REMLA23 & !REMLA22 & !REMLA21 & !REMLA20 & REMLA19 
& REMLA18 & !REMLA17 & REMA16 & !ASOD~ & !BDSELL~ 
# 'AEN & REMLA23 & !REMLA22 & REMLA21 & !REMLA20 & !REMLA19 
& !REMLA18 & !REMLAI17 & !ASA0O~ & BALE 
# !AEN & REMLA23 & !REMLA22 & REMLA21 & !REMLA20 & !REMLA19 
& !'REMLA18 & !REMLA17 & '!ASAO~ & !BDSELL~ 


# !BALE & !BDSELL~; 


End; 


FIGURE 28. Equations for U220 
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!ASAO~ # ASAO~; 
'REMLA2O & 


REMLA19 


TL/F/10429-41 


609-NV 


AN-609 


MODULE U221R1 


TITLE 

‘Host Interface Sequencer 
Device:U221 Filename:U221R1 
Jim Margeson 

Date: 2/4/88 

Revision: 2.01' 


U221RJ1 device 'plév8r'; 


CK1B pin is 
BDSELL~ pin 23 
REMA15 pin 33 
REMACK~ pin 4; 
REMA14 pin 57 
REMR~ pin 6; 
REMW~ pin 73 
RSTI~ pin 8; 
NC9 pin 9; 
GND pin 10; 
OE~ pin 11; 
REMDG~ pin 12; 
REMACKD pin 13; 
RDYHI~ pin 14; 
REMREQ~ pin 15; 
REMRL pin 16; 
REMWL pin 17; 
NC18 pin 18; 
NC19 pin 19; 
vcc pin 20; 
Equations 
!REMRL 7= REMR~ # BDSELL~ # !RSTI~; 
!REMWL := REMW~ # BDSELL~ # !RSTI~; 
'REMREQ~ := !BDSELL~& !REMA15 & REMWL & !REMACKD & RDYHI~ & RSTI~ 
# !'BDSELL~ & REMA15 & !REMA14 & REMWL & !REMACKD & RDYHI~ & RSTI~ 
# !BDSELL~ & !REMA15 & REMRL & !REMACKD & RDYHI~ & RSTI~ 
# !BDSELL~ & REMA15 & !REMA14 & REMRL & !REMACKD & RDYHI~ & RSTI~; 
'REMACKD := REMACK~ # BDSELL~ # !RSTI~; 
'RDYHI~ := !BDSELL~ & !REMA15 & REMREQ~ & REMACK~ & REMACKD & RDYHI~ 
& RSTI~ 
# !BDSELL~ & REMAI15 & !REMA14 & REMREQ~ & REMACK~ & REMACKD 
& RDYHI~ & RSTI~ — 
# REMWL & !RDYHI~ & RSTI~ 
# REMRL & !RDYHI~ & RSTI~; 
! REMDG~ = !REMACK~ & REMWL & RSTI~ | 
# ‘!BDSELL~ & !REMAIS & !'REMR~ & RSTI~ 
# 


End; 


!'BDSELL~ & REMA15 & !REMA14 & !REMR~ & RSTI~; 


FIGURE 29. Equations for U221. 
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TL/F/10429-42 


MODULE U222R 


TITLE 


Device:U222 


Jim Margeson 
12/15/88 


Date: 


Revision: 


2.0' 


Filename:U222R 


U222RJ device 'p20v8r'; 


609-NV 


CK1B pin Z3 
CREGEN~ pin 2; 
REMA1 3 pin 37 
REMA12 pin 4; 
REMA11 pin S; 
REMRST pin 6; 
PBRST pin 73 
PBNMI pin 8; 
UARTINT pin ; 
REMD0 pin 10; 
RGPINTRES~ pin 11; 
GND pin 12; 
OE~ pin 13; 
REMR~ pin 14; 
STATROC~ pin iS: 
AREGC pin 16; 
AREGROC~ pin 17; 
INT~ pin 18; 
NMI ~ pin 19; 
SRSTL~ pin 20; 
RSTI~ pin 21; 
ATINTRES~ pin 22; 
REMW~ pin 23% 
vcc pin 24; 
Equations 
!'SRSTL~ = REMDO & !CREGEN~ & REMA13 & !REMA]2 & !REMAI11 & !REMW~ 
& !REMRST & !PBRST 
# REMDO & !SRSTL~ & !REMRST & !PBRST 
# CREGEN~ & !SRSTL~ & !REMRST & !PBRST 
# !'REMAI13 & !SRSTL~ & !REMRST & !PBRST 
# REMA12 & !SRSTL~ & !REMRST & !PBRST 
# REMA11 & !SRSTL~ & !REMRST & !PBRST 
# REMW~ & !SRSTL~ & !REMRST & !PBRST; 
'RSTI~ := !SRSTL~ # REMRST # PBRST; 
!INT~ = REMDO & !CREGEN~ & REMA13 & !REMA12 & REMA11 & !REMW~ 
& RGPINTRES~ & RSTI~ 
# REMDO & !INT~ & RGPINTRES~ & RSTI~ 
# CREGEN~ & !INT~ & RGPINTRES~ & RSTI~ 
# !REMA13 & !INT~ & RGPINTRES~ & RSTI~ 
# REMA12 & !INT~ & RGPINTRES~ & RSTI~ 
# !REMA11 & !INT~ & RGPINTRES~ & RSTI~ 
# REMW~ ¢& !INT~ & RGPINTRES~ & RSTI~; 
!'NMI~ = PBNMI # !CREGEN~ & !REMAL3 & {REMA12 & REMA11 & !REMW~; 
! AREGC = CREGEN~ # REMA13 # REMA12 # REMAI1 # REMW~; 
'AREGROC~ = !CREGEN~ & !REMA13 & !REMA12 & !REMA11 & !REMR~; 
{STATROC~ = !CREGEN~ & !REMAL3 & REMA12 & REMA11 & !REMR~; 
!ATINTRES~ = !CREGEN~ & !REMA13 & REMA12 & !REMA11 & !REMW~ # !RSTI~; 
End; 


FIGURE 30. Equations for U222 
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TL/F/10429~43 


AN-609 


MODULE U223R 


TITLE 


Device:U223 


Filename:U223R 


Jim Margeson 
Date: 12/15/88 


Revision: 


2<0" 


U223RJ device 'plév8c'; 


NC1 
BDSELL~ 
REMA15 
REMA1 4 


Equations 


!WAIT~ = 


# 
# 
# 


! REMRDY 
ENABLE REMRDY 


!MSOC~ 


!'FSOC~ 


! CREGEN~ 


'A13 


ENABLE A13 


End; 


pin 1; 
pin 2; 
pin 37 
pin 4; 
pin 53 
pin 6; 
pin 73 
pin 8; 
pin 9; 
pin 10; 
pin ii: 
pin 12; 
pin 13; 
pin 14; 
pin 15; 
pin 16; 
pin LT; 
pin 18; 
pin 19; 
pin 20; 


{BDSELL~ & !REMA15 & RDYHI~ & 
'BDSELL~ & !REMA1S & RDYHI~ & 
!BDSELL~ & REMA15 & !REMA14 & RDYHI~ & 
!BDSELL~ & REMA15 & !REMA14 & RDYHI~ & 


!BDSELL~ # BDSELL~; 
'WAIT~; 


tl 


!REMACK~ & !REMA15; 
= !REMACK~ & REMA15 & !REMA14; 
= !BDSELL~ & REMA15 & REMA14; 


= !REMA14 & !REMA1S; 
= !REMACK~; 


FIGURE 31. Equations for U223 
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!REMW~ 
!'REMR~ 
! REMW~ 


!'REMR~ 


& RSTI~ 
& RSTI~ 
& RSTI~ 


& RSTI~; 


TL/F/10429-44 


QUANTITIES ARE FOR 1 BOARD 
PARTS FOR WAVE SOLDER: 
Ll: 14=PIN SOCKETS 


U5 
U6 
U28 
U225 


23 16=-PIN SOCKETS 


Ul2 
ULS 
U27 
RN1 


33 20=-PIN SOCKETS 


U25 
US 
Ull 
U29 
ug 


U1LO 

U102 
U103 
U1ll12 
U1135 


U1L22 
U123 
U132 
U135 
U7 


56 


5% 
Ul 
63 
U2 
U26 
U1Ol 
U111 
Ul21 
U131 


73 


'U104 


FIGURE 32. Parts List 
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U105 
U106 
U107 


U114 
U115 
Ull6 
U117 


U124 
U125 
U126 
U127 


U134 
U135 
U1356 
U137 


24-PIN DIP SOCKETS 


68=PIN PLCC SOCKET 


44-PIN PLCC SOCKET 


24-PIN SIP SOCKETS 


ll 


1 


6 


16 


609-NV 


AN-609 


QUANTITIES ARE FOR 1 BOARD 14: 15 OHM RESISTOR 

PARTS FOR WAVE SOLDER: R4 

8: 25 pF CAPACITOR 2 RS 

03 15: 220 OHM RESISTOR 

C4 R6 

; 0.1 pF CAPACITOR 81 Rll 

- ™ R12 

aa R16 

7 | 16: 470 OHM RESISTOR 

che R7 

cg R8 

C17-C92 R10 

10: 25 pF or 22 pF 10 17: 2.7k RESISTOR 
CAPACITOR, 15V R13 
OR GREATER 14 

Cl (NOTE 2) 15 

C2 (NOTE 2) 17 (NOTE 4) 

ClO = (NOTE 2) 18: PUSH BUTTON SWITCH 


Cll (NOTE 2) 


C12 (NOTE 2) sl 

C13. (NOTE 2) 

C14 (NOTE 2) 19: 20 MHz CRYSTAL 
C15 (NOTE 2) YL 

Cl6 = (NOTE 2) 20: 9=PIN CONNECTOR 
C93. (NOTE 2) y1 

ll: LED 1 21: 25-PIN CONNECTOR 
D1 (NOTE 3) 32 

12: 12k RESISTOR 2 22;  7=PIN POWER CONNECTOR 
ie 33 (NOTE 5) 

13: 330 OHM RESISTOR 1 om Saree eee 

a J7 


FIGURE 32. Parts List (Continued) 
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QUANTITIES ARE FOR 1 BOARD 
PARTS FOR WAVE SOLDER: 
24: PAIR OF JUMPER PINS 20 


Wl 
W2 
W3 
W4 


WS (4 PIN SQUARE) 
W5 


W6 (4 PIN SQUARE) 
W6 


W10 
Wil 
W12 
W13 
W14 
W15 
W16 
W17 
W18 
W19 
W20 
W2l 


NOTE 1: DO NOT USE SOCKET FOR SPARE U224 

NOTE 2: POLARITY SHOWN IN THE ASSEMBLY DRAWING 

NOTE 3: ANODE AND CATHODE MARKED "A" and "C" ON SILK SCREEN 
NOTE 4: MOUNT AS SHOWN IN ASSEMBLY DRAWING 

NOTE 5: PLASTIC FLANGE TOWARD C75 

FRC ICR a akc akc a ae ae aa a ae a ae a akc ae ae ae ae ae ak ak ak 


FIGURE 32. Parts List (Continued) 
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609-NV 


AN-609 


PARTS FOR INSERTION: U211 16V8_25 
U225 74L814 Ll U221 16V8_25 
U5 74AS74 2 U208 16V8_12 4 
U6 74AS74 U209 16V8_12 

U220 16V8_12 
. ao va nc : 
Ul3 748157 U1l4 DM87SR193N 2 
U27 MAX232 ql U15 DM87SR193N 

U23 74ALS646 (NOTE 7) 2 
RNL = 22RPAK 1 U24 74ALS646 
U25 74ALS240 1 U200 20V8.12 1 
US 74ALS244 1 U201 20V8_25 
Ull 74ACT244 2 U202 20V8_25 
U29 74ACT244 U203 20V8_25 
ug 74ACT245 10 U205 20V8_25 
U10 74ACT245 U206 20V8_25 
U1l02 74ACT245 U222 20V8_25 
U103 74ACT245 Ul DP8500 1 
ne ties ve bres 
U122 74ACT245 U26 NS16C450V 1 
U123 74ACT245 U1lLOL DP8511V 4 
U132 74ACT245 Ull1l DP8511V 
VU135 74ACT245 U121 DP8511V 
U7 7AACT373 3 U131 DP8511V 
U8 74ACT373 U104 M5M4C264L 120 ns 16 
U16 T4ACT373 U1l05 M5M4C264L 
U19 74ACT373 VU106 M5M4C264L 
U20 74ACT373 U107 M5M4C264L 
U21 74ACT373 U114 M5M4C264L 
U22 74ACT373 U115 M5M4C264L 


U1l16 M5M4C264L 


U4 =: 74ACT374 1 U117 M5M4C264L 


ULS —- 74ALSS74 1 U124 M5M4C264L 
U1l00 74LS195A (NOTE 6) 4 : 3 U125 M5M4C264L 
U110 74LS195A U1l26 MS5M4C264L 
U120 74LS195A U127 M5M4C264L 
U150 74LS195A U154 M5M4C264L 
U204 16V8.25 5 U1S5 MSM4C264L 
U207 16V8_25 U136 M5M4C264L 
U210 16V8_.25 U15S7 M5M4C264L 


NOTE 6: CENTERED IN 20=-PIN SOCKET 
NOTE 7: OR 74AC646 


FIGURE 32. Parts List (Continued) 
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CD 


National © 
Semiconductor 


APPENDIX A: Related Datasheets 


The table below lists a series of other products/functions that may have interest and which are available as datasheets from 


National Semiconductor. 


Product Number 


Product Description 


LH4002 Wideband Video Buffer 

LH4003 Precision RF Closed Loop Buffer 

LH4004 Wideband FET Input Buffer Amplifier 

LH4101 Wideband High Current Operational 
Amplifier 

LM592 Differential Video Amplifier 

LM675 Power Operational Amplifier 

LM1203 RGB Video Amplifier System 

LM1211 Broadband Demodulator SSC129 

LM1578 Switching Regulator 

LM1823 Video IF Amplifier/PLL Detection System 

LM1875 20W Power Audio Amplifier 

LM1881 Video Sync Separator 

LM1884 TV Stereo Decoder 

LM1886 TV Video Matrix D to A 

LM1889 TV Video Modulator 

LM2889 TV Video Modulator 

LM6161 High Speed Operational Amplifiers 

LM6164 High Speed Operational Amplifiers 

LM6165 High Speed Operational Amplifiers 

LMC555 CMOS Timer 

NS32CG16 High-Performance Printer/Display Processer 

APPLICATION NOTES 

Note No. Description 

AN-391 The LM1823—A High Quality TV Video 
IF Amplifier and Synchronous Detector 
for Cable Receivers 

AN-402 LM2889 R.F. Modulator 


V XIGNAddV 


S]eoyse}eg palejoy 


National | 
Semiconductor 


Physical Dimensions 


NS Package Number M16A 
0.150 0.157 
(3.810 — 3.988) 
0.010 —0.020 
10.254 0.508) «48° 
C "ALLLEADS 


= 


0.008 — 0.010 
(0.203 —0.254) 
TYP ALL LEADS 0.004 TYP ALL LEADS 


0.102 
ALL LEAD TIPS 


All dimensions are in inches (millimeters) 


16 Lead (0.150” Wide) Small Outline Molded Package (M) 


0.386 —0.394 
(9.804 — 10.00) 


0.228 —0.244 30° 
(5.791 —6.198) TYP 
LEAD NO.1 
IDENT 0.010 max 
(0.254) 


(1.346 — 1.753) 0.004 — 0.010 


(0.102 — 0.254) 


SEATING 


a h PLANE 


0.014—-0.020 typ 
(0.356 —-0.508) 


(0.203) M1GA (REV H) 


16 Lead Molded Dual In-Line Package (N) 


0.843—-0.870 
— I~ (21.41-—22.10) 


NS Package Number N16A 


PIN NO. 1 IDENT 


95°+5° 0.009-0.015 
(0.229—0.381) 
0.075 +0.015 
+0.040 Eee Tey 
0.325 “n15 (1.905 +0.381) 
+1.016 
(8.255 ay) 


0.280 
(7.112) 0.030 
0300-0320 | (0762) 
(7.620—-8.128) 0.065 0.060 0.130 +0.005 
| (1.651) (1.624) (3.302 +0.127) 


0.250 +0.005 
(6.350 +0.127) 


0.145-0.200 
(3.683-5.080) 


0.100 +0.010 
(2.540 +0.254) 


N16A (REV E) 


4-4 


44 Pin Ceramic Pin Grid Array, Cavity Up 
NS Package Number U44A 


0.790 0.810 
: (20.07-20.57) °" ‘ . (17.48 18.09) a 


INDEX MARK 


0.080 —0.105 
0.175 —0.185 (2.032 — 2.667) 
(4.445 — 4.699) 


0.050 
Me tae (1.270) "YP 
DIA 4 PLCS 
re 270) 0.005 a TYP 
: he Se 0.016—0.020 (0.127) UA4A (REV A) 


*— (0.406—0.508) DA TYP 


20 Lead Plastic Chip Carrier (PCC) 
NS Package Number V20A 


4 SPACES AT 
0.050 
(1.270) lit ai 
19] 2012 |3 A x 45° 
0.080 
DIA NOM 
PEDESTAL 
15°  \VIEWA-A 
Ls ci | 3 
4 SPACES AT 
0.050_ er 0.226. 
(1.270) (5.740) 
NOM 
SQUARE 
0.310 —0.330 
(7.874 — 8.382) 
(CONTACT DIMENSION) 
0.013—0.018 
(0.330 —0.457) 
0.026 —0.032 
(0.660 — 0.813) £020, 
TYP (0.508) 
MIN 
aee8-—0018 ; 0.032 — 0.040 
aac 0.104 —0.118 (0.813 — 1,016) 
(2.642 — 2.997) 0.165 — 0.180 
(4.191 — 4.572) 
PIN NO.1 
IDENT 0.385 —0.395 
(9.779 — 10.03) 


SQUARE V20A (REV J) 


suoisuewig jeaisAud 


Physical Dimensions 


28 Lead Plastic Chip Carrier (PCC) _ _ 
NS Package Number V28A 


6 SPACES AT 


0.050 
(4270) 
0,130 
NO 


VIEW A-A 


(1.143) 
x 45° 
0.410 —0.430 
(10.41 10.92) 
SQUARE | 
n.o2g | (CONTACT DIMENSION) | i 
0.032-0.000 (0.508) > |\(0.330-0.457) 0.165 -0.180 
(0.813—1.016) MIN TYP (4.191 4.572) 
t t 15 
| 0.006 —-0.015 | | , 
(0.127 —0.361) | t 
PIN NO. 1 0.026 —0.032 | 0.104--0.1 
IDENT “1 (0.660 —0.813) guaran 
0.450 | TP 
(11.43) 
REF SQ 
0.485 —-0.495 
(12.32 — 12.67) 
SQUARE V28A (REV G) 
44 Lead Plastic Chip Carrier (PCC) 
NS Package Number V44A | 
| <——-——- 
0.060 - 
(1.270) mabglalgalataly 
7 1 
4 10 SPACES AT 
0.050 = 0.500 
(1.270 = 12.70) 0.045 VIEW A-A 


(1.143) (1.270) (1.143) 
REF 
0.610 —0.630 
<< (15.49- 16.00) 
0.032 —0.040 : SQUARE 0.165 —0.180 
108131016) (0'508) (CONTACT DIMENSION) tor3—0.018 (4.191-4.572) 
MAX MIN a (0.330--0.457) MAX 
pa | TYP } t 


eats 


(0.127 — 0.381) 
MAX 


ilveltadlititatititg) . 


| 0.026 - 0.032 


: (0.660 — 0.813) 
0.650 ; | bs ve 


0.104 — 0,118 


(2.642 —2.997) © 


(16.51) 
REF SQ 

0.685 — 0.695 

(17.40 — 17.65) V44A (REV H) 
SQUARE 


4-6 


68 Lead Plastic Chip Carrier (PCC) 
NS Package Number V68A 


0.080 = 0.800 (0.508) 
(1.270 = 20.32) MIN 0.045. 
0.050 16 SPACES AT 0,104 0.118 | (1.143) 
. : 2.642 —2.997) 
REF 68 1 ( Ls (i 
(ms 
oumt 
y ——T 0.045 
y; (1.143) 
am 
0.013 —0.018 7 0.950 
0.330 —0.457) Sy—+ 
0910-0990 (yp) Dd sealipes 
0,050 = 0.800 | om! REF SQ 
9.060 = 0.800 (23.11 — 23.62) = : 
16 SPACES AT CONTACT At (25.02 — 25.27) 
DIMENSION dt —_t SQUARE 
\ om 
0.050 18° — 
(1.270) — 
REF 1 
a i = 0.026 — 0.032 
26 7 | (0.660—0.813) 
ai ( a 
0.032 —-0.040 | 
(0.813 — 1.016) | 
0.165 —0.180 
0.005 —0.015 (4.191 —4.572) 
(0.127 —0.381) 
V68A (REV G) 
84 Lead Plastic Chip Carrier (PCC) 
NS Package Number V84A 
1.190 +0.005 0.060 
(30.23 + 0.127) (1.824) 
“a MIN 
TYP : 0.104 —-0.118 
(2.642 — 2.997) 
0.020 
(0.508) Ir 
: . 
0.013—0.018 Gor tre ee 
basalt : Te. — 
— 0.085 
= ~| (2.160) 
— NOM 
—— 
- — 
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